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Chapter 1 

Introduction 



This thesis aims to investigate mathematically two of the most robust phe- 
nomena in modern physics. The first is the 'uncertainty principle' in quantum 
mechanics. It states that no measurement can be performed without disturbing 
the system. The second is the 'cquivariance principle' in classical field theory, 
which states that any transformation of space-time must also transform the 
fields. 

The word 'robust' should be understood in the sense of 'independent of the 
dynamics of the particular model'. A physical theory can usually be separated 
into a kinematic and a dynamical part. The kinematic part of a theory describes 
the basic physical objects in terms of mathematics. The dynamical part then 
prescribes the precise way in which these variables evolve. Our results will 
largely be independent of the dynamics, making them applicable to a wide 
range of different models. 

In the case of quantum mechanics for example, the kinematic part of the 
theory states that a system is described by a Hilbert space, observables by self- 
adjoint operators, states by vectors and transformations by unitary operators. 
This alone will suffice for us to prove quantitative versions of the uncertainty 
principle, as well as asymptotic optimality of our many-qubit state estimation 
scheme. The dynamics of the theory, embodied by a model-specific Hamilton 
operator, only come into play when we test these ideas in specific situations. 

In the case of classical field theory, the kinematic part of the theory en- 
tails that space-time is modelled by a smooth manifold, fields are modelled by 
sections of a bundle over this manifold, and transformations are described by 
bundle automorphisms. This alone will suffice for us to classify the relevant 
bundles. The dynamical part of the theory, in the form of a model-specific 
Lagrangian, will have no part in this. 

Although I would warmly recommend it, I am aware that the reader may 
not wish to read this thesis from cover to cover. Chapters 3 through 7 can 
be read independently, with perhaps an occasional glance at the 'background' 
chapter 2. We give a brief description of each chapter. 



2 



Chapter 1 



Chapter 2. Background We give a short introduction to quantum prob- 
ability theory and classical field theory, providing background and motivation 
for the rest of the thesis. 

Chapter 3. Information Transfer Implies State Collapse Wc prove 
that within the framework of quantum mechanics, a transfer of information 
from a system to the outside necessarily causes decoherence on that system. 
We also prove a quantitative version of this, linking the amount of decoherence 
to the quality of information transfer. Finally, we shed some light on the fact 
that coherence is very hard to observe in macroscopic systems. This chapter is 
an adapted version of [JM06], written together with Hans Maassen. 

Chapter 4. Unifying Decoherence and the Heisenberg Principle With- 
in the framework of q^iantuni probability theory, wc prove quantitative versions 
of the Heisenberg principle, the joint measurement theorem, the no-cloning 
theorem, and the collapse of the wave function. An important difference with 
chapter 3 is that all bounds will be in terms of the measurement procedure alone; 
they will not depend on the particular state of the measured system. This is 
a good thing because the state of the measured system is generally unknown- 
hcncc the need for measurement. A part of this chapter appeared earlier in 
[JanOS]. 

Chapter 5. Optimal Pointers for Joint Measurement We apply the 
joint measurement inequality of chapter 4 to the specific situation of a 2-level 
atom, i.e. a qubit, coupled to the quantized electromagnetic field. Information 
on fjx and az leaks into the field and affects its quadrature, which can then be 
probed using homodyne detection. The joint measurement inequality provides 
us with a sharp upper bound on the quahty of any simultaneous measurement 
of ax and CTz. Using quantum stochastic calculus, we investigate how close one 
can come to this bound using the above setup. Somewhat to our surprise, we 
found that it cannot be reached, although one can come as close as 5.6%. This 
chapter is an adapted version of [JB06], written together with Luc Bouten. 

Chapter 6. Optimal Estimation of Qubit States We propose an asymp- 
totically optimal scheme to estimate the state of n identically prepared qubits. 
The qubits are coupled to the quantized electromagnetic field for some period of 
time, during which continuous time measurements on the field are performed. 
The proof of optimality relies on the concept of local asymptotic normality for 
qubits: for large n, the ensemble of qubits becomes statistically equivalent to 
the tensor product of a classical Gaussian distribution and a Gaussian state on 
a quantum harmonic oscillator. The Gaussian states are well understood, which 
means that we can use the statistical correspondence to better understand the 
qubits. 

In a way, this is not unlike the classical problem of determining the bias of 
an unfair coin that is tossed n times. In this case, the n coin tosses become 
statistically equivalent to a single Gaussian in the limit of large n. One should 
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guard against taking this analogy too far though: in the quantum case, there is 
a difference between the act of handing someone n identical qubits to probe, and 
the act of handing someone n times a single qubit to probe. At least for mixed 
states, a joint measurement on an ensemble of n qubits performs strictly better 
than the combined information of n single qubit measurements. The whole is 
more than the sum of its parts, so to speak. This chapter is an adapted version 
of [GJK08], which was written together with Madalin Gu^a and Jonas Kahn. 

Chapter 7. Bundles with a Lift of Infinitesimal Diffeomorphisms A 

bundle is called 'natural' if its fibres transform in a local fashion under diffeo- 
morphisms of the base. Examples include tangent bundles, frame bundles, and 
most other bundles of a local geometric nature. We extend the notion of a 'nat- 
ural' bundle to that of an 'infinitesimally natural' one by requiring the fibres to 
transform under infinitesimal diffeomorphisms only. We classify exactly what 
is gained by this extension. Bundles of spin type are never natural, but it turns 
out that some of them are indeed natural in the above infinitesimal sense. In 
classical field theory, fields are sections of a bundle over space-time. Spin and 
energy of a field are well-defined precisely when the corresponding bundle is in- 
finitesimally natural. The fact that bundles that correspond to fermionic fields 
(such as quarks, electrons, etc.) are never natural shows that the generalization 
is really called for. 

Samenvatting We give a non-rigorous Dutch summary of the main results, 
containing no formulae and as many pictures as possible. If you don't have much 
experience with (this kind of) mathematics, and if you understand Dutch, then 
this may be a good place to start reading. 
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Chapter 2 

Background 



The chapters to come can be read independently, and are self-sufficient to a 
certain degree. Complete autarky however is a dream, and, when realized in 
a PhD thesis, probably a nightmare. From time to time then, the reader is 
referred elsewhere, asked to trust a statement bluntly placed before him, or 
simply assumed to 'know what is going on'. This is inevitable. The flip side of 
the progress of science is the overspecialization that comes with it. 

However, as details and applications become more and more complex, the 
foundations of science have a tendency to simplify with time. This makes it 
feasible for us to sketch the fundamentals of quantum probability theory and 
classical field theory in the next forty-odd pages. We aim to explain some of the 
more vital ingredients that will enter our reasoning, but the main goal here is 
to provide background and motivation for the problems that will be addressed 
later on. 

We will start by giving a short introduction to quantum mechanics, focusing 
first on the way in which the theory gives rise to probability measures, and then 
on matters of transformation and symmetry. An even shorter introduction to 
quantum probability theory will follow, with an emphasis on transformations of 
the theory. Everything will be done first in the finite dimensional setting, and 
only then in the general case. The purpose of this is to prevent the elegant, yet 
somewhat involved functional analytic side of the theory from overshadowing 
its simpler algebraic counterpart. 

An important tool for us will be the quantum stochastic calculus of Hudson 
and Parthasarathy. This is a generalization of the Ito stochastic calculus on 
Wiener space, which is essentially calculus with stochastic infinitesimal incre- 
ments. After a primer on Fock spaces and Weyl operators, we will spend some 
words on the Wiener measure and Ito calculus. We will then see how the field 
operators in a free quantum field theory form a quantum version of the stochas- 
tic infinitesimal increments of Ito calculus, giving rise to quantum stochastic 
calculus. 

The last part of this chapter will be a brief introduction to classical field 
theory from a differential geometric point of view. In a nutshell, this means 
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that we will consistently treat fields as sections of a fibre bundle. The emphasis 

is once again on transformation and symmetry. We formulate the requirement 
that fields should transform in a definite fashion under infinitesimal space-time 
transformations. The ramifications of this will become clear in chapter 7. 

As already stated, we mean to sketch only the most elemental aspects of the 
theory. For a more complete picture, at perhaps a more responsible pace, the 
reader is referred to [Par92], [Pra04], [FR04], [Ban04], and [Maa04], on which 
we based the following account. 

2.1 Quantum Mechanics 

In quantum mechanics, a system is modelled by a Hilbert space %. A state of 
the system is modelled by a ray 'Cijj in %, and observables are represented by 
self-adjoint, possibly unbounded operators A on "H. Transformations of a closed 
system are described by unitary (or aiitiunitary) operators U on %. and there 
is a distinguished continuous one-parameter group t i— !■ J7t of unitary operators 
that describes time evolution. 

Quantum mechanics can only be interpreted in terms of probabilities. There 
is a canonical map (C^, A) i-> that associates to each ray <Cip and self-adjoint 
operator A a probability measure on the spectrum of A. It is interpreted as 
follows. 

// the system is in state Ctp, then observation of an observable A yields an 
outcome X G S with probability P^ {S) . 

Everything else follows from these postulates. 

2.1.1 Probabilities and the Spectral Theorem 

We will first describe the map (Ctl), A) P^ in the special case of finite dimen- 
sional Hilbert space, and then in the general case. 

The finite dimensional case is simple but interesting. It is simple because a 
choice of orthonormal basis identifies H with C", reducing many of our problems 
to linear algebra. It is interesting because the results in chapter 3 and 4 are as 
nontrivial here as they are in the general case, and if the reader will forgive 
me my patronizing remark- it might be helpful to keep the finite dimensional 
case in mind when reading these chapters. 

Let us take the opportunity to fix some notation. All Hilbert spaces are over 
C unless specifically mentioned otherwise, and we will take the inner product 
( • I • ) to be linear on the right. The adjoint of A is denoted A^, and B{'H) 
denotes the algebra of bounded linear operators on H. For n-dimensional H, 
this is just the algebra of n x n-matrices over C. 

Hilbert Spaces of Finite Dimension 

An orthogonal projection E is in particular a self-adjoint operator. Because its 

spectrum is {0, 1}, it should be interpreted as an event. The outcome 1 means 
that the event occurs, the outcome means that it does not. 
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If E projects onto Hi C H, then we have the orthogonal decomposition 
% = 'H{)®'Hi. A unit vector ^Ij splits as -0 = -00 + "^i, with HV'olP + HV'ilP = 1- 
This leads us to define the probability distribution on {0, 1} by 

Pv,(o) = iivof , Pv,(i) = iiV'iir- 

This depends only on the ray Cf/', so that we have our map (CV',j4) i-^- P^ in 
the special case that A is a projection. 

We can play the same game if we split H into any number of mutually or- 
thogonal spaces. Such a decomposition determines a projection valued measure. 

Definition 2.1 Let be a finite set, and T-L a finite dimensional Hilbert space. 
Then a projection valued measure (or PVM) is a map E -.0, ^ ^{^) such that: 

- E{oj) is an orthogonal projection for all co G Cl. 

-Ifoj^ uj', then E{uj)E{uj') = 0. 

- EnE{u:) = l. 

If is the image of -E(w), it is clear that H = 0q'Hw. Accordingly, a 
unit vector tp decomposes as V = X^nV'w) with = {ip \ E{uj)tp) and 

I E{uj)t/j) = 1. This means that a unit vector ip gives rise to a measure 
P^ on defined by P^(w) = {ip \ E{uj)ip}. 

For any unit vector tjj, we will define the map : B{'H) C , called a pure 
state, by 

p^iA) = {^\A^). 

With this notation, we have 

p,poE. 

Note that = p^i if and only if ip' = cijj for some c G C^, so that a pure 

state corresponds precisely to a ray Cijj in the Hilbert space. We will identify 
the two, and think of the pure state space as either 

v{u) = {p^;i>en, \m = i} 

or as the space of rays 

v{n) = n- {oj/C" . 

We leave it up to the reader to decide whether V stands for 'pure' or 'projective'. 

The spectral theorem says that each self-adjoint operator A gives rise to a 
projection valued measure on its spectrum. 

Theorem 2.1 Let A be a self-adjoint operator on a finite dimensional Hilbert 
space T-L, and let Spec{A) be its spectrum of eigenvalues. Then there exists a 
unique projection valued measure Ea ■ Spec{A) — )• B{'H) such that 

A= Yl ^^^w • 

\eSpec{A) 
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The Ea{X) are the projections onto the eigenspaces Hx, and H = 0spec(A) 

is the eigenspace decomposition. 

The map {Cip, A) ¥^ is now defined as follows. 

- The ray Ctp is identified with the pure state : B{'H) — > C. 

- An observable A gives rise to a PVM Ea : Spec(A) B{H). 

- The measure on Spec(A) is given by = o E. 

We interpret P^(A) as the probabihty that if the system is in state C-^, a 
measurement of A yields the outcome A. 

Infinite Dimensional Hilbert Spaces 

Although there do exist physically relevant systems where the Hilbert space is 
finite dimensional, this is not typically the case. We adapt our framework to 
encompass general Hilbert spaces. 

We equip the algebra B{'H) of bounded operators on % with the weak topol- 
ogy, the coarsest topology that makes p,j}{A) = {ijj \ Aip) continuous for all ijj in 
T-L. We then introduce the notion of a projection valued measure for general 
Hilbert spaces as follows. 

Definition 2.2 Let (Jl, S) be a measurable space, and T-L a Hilbert space. Then 
a projection valued measure or PVM is a map E ^ B{%) with the following 
properties. 

- E{S) is a projection for all 5 e S. 

- E{$) = 0, and E{Q.) = 1. 

- IfSinS2 = 0, then E{Si)E{S2) = 0. 

- // Si is a sequence of disjoint sets in S, then E{[j°^Q Si) — J2iLo E{Si), 
where the limit is in the weak topology. 

Each ray Cip induces a probability measure P^ on (12, S) by applying the pure 
state p^{A) := {ip \ Aip) to E, i.e. 

¥^iS) = {p^ o E){S) . 

If / is a measurable function on O, then an integral 

lEif) := [ f{uj)E{d^) (2.1) 

can be defined by the weak limit of Riemann sums. The resulting operator 
Isif) on is bounded if / is bounded, which invites us to consider Ie as a 
linear map Ie : L^ {Cl, E) — > B{'H). It is a homomorpliism of *-algebras, i.e. it 
respects multiplication Isifg) — ^£;(/)-^£;(ff) and adjunction lE{f^) — Isif )^- 
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In particular, Isif) is self-adjoint if / is real, and positive if / is positive. With 
respect to P^, we have E(/) = 

Observables correspond to self-adjoint (i.e. symmetric, closed and possibly 
unbounded) operators on "H. The spectral theorem says that any self-adjoint 
operator gives rise to a projection valued measure on its spectrum, called the 
spectral measure. 

Theorem 2.2 (spectral theorem) Let A be a self-adjoint operator on H. 
Then there exists a unique PVM Ea ■ T,{R) — )• B(H) on the Borel sigma-algebra 
of M such that, with A the identity function on M., 



for all ijj in the domain of A. The PVM is concentrated on the spectrum of A. 



Any pure state therefore induces a probability measure on Spec(A) by 
Pj/) — PtP°Ea- Under this probability measure, the expectation of the observable 
vl of a system in a state is ]E(A) = p^(A). Its variance is p^{A^) — p^{Af'. 

2.1.2 Transformation, Symmetry and Conservation Laws 

Let us devote some attention to the way in which transformations and symme- 
tries occur in quantum mechanics, and then give examples of relevant quantum 

systems. Although the proper trcatineiit of syininctry in quantum inecliauics 
will not play a major role in this thesis, it will help us to properly interpret the 
examples. 

Hilbert Spaces of Finite Dimension 

As before, we will first focus attention on finite dimensional Hilbert spaces, 
before looking at the general case. Throughout, we will denote by U{^) the 
group of unitary operators on %. 

Transformations A unitary operator U G U{T-L) induces an automorphism 
au : A I— > W AU of B{T-L). This in turn acts on V{H) by its dual, alj : i— >■ 
PV ° '^u ■ The action of the unitary operators on the pure state space is therefore 
governed by the group of inner automorphisms VU(1-L) — {ajj ; U € U^H)}. 

Recall that the pure state space has the equivalent description ■p('H) = 
"H — {0}/C^, on which ajj acts by Ctp CUij:. This shows that ajj = ajj' 
if and only if U' = cU for some c G C ^ . We therefore have the equivalent 
description V{U{'H)) ~ U{7i)/C^. Apparently, the V stands for 'projective'. 

The action of an abstract Lie group G on a quantum mechanical system 
should therefore be described by a projective unitary representation, i.e. a 
continuous group homomorphism tt : G — > 'PC/('H). 




Proof. See for example [Par92]. 



□ 
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Recall that a linear unitary representation is a continuous group homo- 

morphism tt : G — > Every linear unitary representation gives rise to a 

projective unitary representation, but it is not true that every projective repre- 
sentation arises in tliis way. 

Since the theory of linear unitary representations is well understood, we 
would like to reduce projective representations to linear ones. This is achieved 
by the following theorem, at the expense of slightly enlarging the group. 

Theorem 2.3 Let G be a connected Lie group, with universal cover G. Let % he 

a finite dimensional Hilhert space. Then every projective unitary representation 
IT of G on T-L comes from a linear unitary representation tt of G on T-L, in the 
sense that the following diagram commutes. 

G — 

I 1 

G VU{H) 

Proof. Since T-L ~ C" for some finite n, we can consider tt as a homomorphism 
G V{U{C")). The derived Lie algebra homomorphism Lie(G) 5u(C") 
extends to a group homomorphism G SU(C") by Lie's second theorem. □ 

In what follows, we will assume that G has been replaced by the universal cover 
of its connected component of unity Go, and tt by a linear unitary representation 
of Go. 

Conservation Laws It is a remarkable fact of life that in every fundamental 
physical theory, infinitesimal symmetries correspond to conservation laws. In 
quantum mechanics, this correspondence is particularly straightforward. 

Infinitesimal transformations are elements of the Lie algebra Lie(G) of G. 
The following theorem describes how they give rise to observables. 

Theorem 2.4 Let n be a continuous unitary representation of G onH. Then 
for each X e Lie(G), we have a well defined skew- symmetric operator 7r(X) 
defined by 

^(X) = 7r(e-*^)|7r(e*^). (2.2) 

As 7r([X, y]) = [7t(X), 7r(y)], this defines a Lie algebra homomorphism tt of 
Lie(G) into the Lie algebra 5u{'H) of skew-symmetric operators on %. 

The operator i'k{X) is self-adjoint, and should be considered as the observable 
corresponding to the infinitesimal transformation X. We denote it J{X). If one 
has to pass to the universal cover, then tt and J are still well-defined, since the 
Lie algebras of G and Gq are the same. 

Time evolution is given by a one-parameter group of unitary transforma- 
tions, i.e. a unitary representation t^Utoi (K, +). Its Lie algebra is again M. 
and the self-adjoint operator J(l) is called the Hamiltonian, denoted H. The 
observable associated to this operator is the energy. 
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Equation (2.2) for the particular case of time evolution reads 

i^Ut = HUt . 
at 

In the 'Heisenberg picture', where U acts by au, one writes = U^AUt to 
obtain Heisenberg 's equation of motion 

j^A = i[H,A,]. (2.3) 

In the dual 'Schrodinger picture', where U acts by a^, one writes '^t = Utip to 
obtain the Schrodinger equation 

ij^i>t = Hi,t. (2.4) 

A transforniation is called a symmetry if it commutes with the time evo- 
lution t I-)- Ut, i.e. if \Ut^T^{g)] = for all i G M. A group G is a group of 
symmetries if -K{g) is a symmetry for every g G G. Because this implies that 
7r(X ) commutes with H, we immediately obtain from Heisenberg's equation of 
motion the conservation law 

If X is an infinitesimal symmetry, then the probability measure on the spectrum 
of J{X) is constant in time. For J{X) = H, this is called conservation of energy. 

The fundamental problem in quantum mechanics is to reconstruct the time 
evolution t^Ut from the given Hamiltonian H. This is usually done by finding 
the eigenspace decomposition T-L = ©spec(//) ^ e for , and then computing 
Ut = e~'*^. This is greatly facilitated by the presence of symmetries. 

The Qubit Time for an example. The internal degrees of freedom of an 
electron are described by the Hilbert space % = €?. It transforms under the 
rotation group S0(3,M) by a projective unitary representation. 

Consider the map PU{C'^) S0(3,M) defined as follows. In terms of the 
basis 

1 0\ fO 1\ fO -i\ (I 

ij' Vi oj' ''y = \i oj' ^^ = Vo -1 

of B(C^), one readily checks that a!7(Al) = Al, and that 

3 3 

<y-uC^Xiai) = ^ RijXjai 

1=1 ij=l 

for some R £ S0(3,]R). The map a„ i— >■ ii turns out to be an isomorphism 
PU (C^) ^ S0(3, R), and its inverse tt is the projective representation of S0(3, M) 
on C^. If we choose the basis 
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of 5o(3,K), then the Lie algebra homomorphism tt : so(3,]R) 5u(C^) gives 
rise to three observables J{tx) = \<yx, Ji^y) = ^^nd J{tz) = -^(Jz- They 
should therefore be considered as the internal angular momenta or spin of the 
electron. 

If the particle is placed in a vertically aligned magnetic field of strength 
B, then the effective Hamiltonian of the system is H = Baz/2. The group 
of symmetries is then not all of S0(3,R), but only the subgroup S0(2,M) of 
rotations around the z-axis. The only conserved quantity is therefore H ~ gcr^. 
Heisenberg's equation of motion i-^At = [H,At] can be solved to yield 

t H^- a(^cos{Bt + 4>)ax + s\n{Bt + 4>)cr^ + hcFz , 

which says that the angular inoinciituin rotates around the 2-axis with a fre- 
quency proportional to the magnetic field. In particular, the angular momentum 
in the 2;-direction is preserved, as was predicted by symmetry considerations. 

The system = is called a qubit, and one can use it to model any 
quantum system with only two (relevant) states, not just the electron. For 
example, in quantum optics, the interaction of an atom with a laser beam is 
often such that only two energy-eigenstates of the atom are affected. It is then 
suHicient to model this atom by a qubit C^, rather than by its full Hilbert space. 
We will make extensive use of this in chapters 5 and 6. 

Infinite DimensionEil Hilbert Spaces 

In infinite dimensional Hilbert space, the situation is similar, but slightly more 
involved. The only qualitative difference is that for some -but certainly not 
for all- Lie groups G, the universal cover G no longer suffices to describe all 
projective unitary representations, in which case a central deformation of the 
algebra of symmetries occurs. 

Transformations A Lie group G of transformations acts on "H by a projective 
unitary representation. This is a continuous group homomorphism G VU {%) 
as before, but as we are no longer in a finite setting, we must pay some attention 
to matters of topology. 

We equip 'P{T-L) with the trace distance or Kolmogorov-distance, which is 
the worst case difference in probability on a single event (projection) E, i.e. 

diPi',P4>) = sup(/3^,(£) - P4,{E)) . 

E 

The strong topology on 'PU{'H) is the one defined by pointwise convergence 
on V{H). If we define the strong topology on U{'H) by pointwise convergence 
on v., then the strong topology on VUCH) is the strongest one that makes 
U{H) VU{T-L) continuous. It is with respect to these strong topologies that 
we require our (projective) representations to be continuous. 

If the Hilbert space is infinite dimensional, one can still reduce projective 
unitary representations of G to linear representations of its universal cover G, 
provided that the second Lie algebra cohomology of Lie(G) vanishes. 
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Theorem 2.5 Let G be a connected Lie group such that H^{Lie{G),'M.) = 0, 
and let G be its universal cover. Then every projective unitary representation 
of G comes from a unitary representation ofG. 

Lie algebras with vanishing second cohomology include the real semisimple Lie 
algebras, but also for example the Abelian Lie algebra M, and the (infinite 
dimensional) Lie algebra of vector fields on any smooth manifold of dimension 
bigger than one. 

If /f^(Lie(G),M) is nonzero, then not all is lost. For every projective rep- 
resentation of G, there exists a central extension G of G by the circle S^, and 
a linear unitary representation of G that induces tt. Since the Lie algebra of 
observables then follows Lie(G) rather than Lie(G), a 'central deformation' is 
said to have occurred. 

Conserved Quantities Let us consider a group of transformations as a uni- 
tary representation tt of G, having replaced the group by a bigger one if nec- 
essary. Then the observables corresponding to infinitesimal transformations 
X e Lie(G) are still precisely the derived Lie algebra elements J{X) = i'k{X). 

Recall that tt is strongly continuous by definition, so that Tr^,{g) = 7r(g)'i/' is 
always a continuous map. A vector ^ is called smooth if tt^ is smooth, and we 
denote the Frechet space of smooth vectors by H°°. 

Theorem 2.6 (Stone, Garding) Let it be a unitary representation of a Lie 
group G on a Hilbert space %. Then the smooth vectors %°° lie dense in %. 
For each X G Lie(G), the expression 

tt{X) = exp(-tX)|7r(exp(tX)) 

defines a closed, densely defined, skew -symmetric operator 'k{X). Its domain 
contains H."^ , and on this space, we have 

i,{[X,Y]) = [k{XU{Y)\. 

Proof See [Kna86, ch. Ill], but also [Par92, sec. 12,13]. □ 

In particular, the generator of the 1-parameter group f h- > [/j of time translations 
is again the Hamiltonian H. In realistic models, its spectrum is usually bounded 
from below, but not from above. 

A Point Particle on a Line Let us consider a single particle on the 1- 
dimensional Euclidean space R. We should have a self-adjoint operator X on'H 
to represent its position. The group of Euclidean motions (M, +) should have 
a projective unitary representation tt on %, and its effect on X ought to be be 
a shift, tt{s)^Xtt{s) = X + si. Since if2(]R,]R) = and t = K, we may take 
TT to be linear rather than projective. The observable J(l) = i7r(l) is called 
the momentum, and denoted P. The above relation then differentiates to the 
canonical commutation relation 



[X,P]=il. 



(2.5) 
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One way to realize this is by setting % = i^(]R), {X'tjj){x) = xtjj{x), and 
{n{s)'ip){x) = tp(x — s). The associated momentum P = —ij^ does indeed 
satisfy -P] = il- A theorem of Stone and von Neumann says that this is 
essentially the only way to realize (the integrated version of) equation (2.5). 

The PVM for X is not overly complicated; Ex{S) is given by multiplication 
with the indicator function I5. That is, {Ex{S)^){x) = ls{x)ip{x). This means 
that the probability distribution induced by V on the spectrum M of X is 
just P;^ (S) = \ijj{x)\'^dx. The function jV'P is the probability density of P^ 
w.r.t. the Lebesgue measure. 

The PVM for P can be derived from this. If F is the unitary Fourier trans- 
form on L2(IR), then one has P = F^-x)F, so that Ep{S) = F''Ex{-S)F. 
This means that P^('S') = Jg \Ft{j\'^dx, and the squared Fourier transform of ip 
yields the probability density of P^. 

In sections 2.3.3 and 2.3.4, we will find two other ways to describe the point 
particle on a line. 

A Point Particle in M.'^ Let us consider a single particle in the 3-dimcnsional 
Euclidean space M^. We require 3 commuting self- adjoint operators Xi, X2 and 
X3 to represent its position. The Euclidean motion group x 0(3) should 
have a projective unitary representation tt on "H such that each w G M'^ shifts Xi 
to Xi + Vil, i.e. Tr{vy Xi'K{v) — Xi + Vil, and each rotation R e 0(3) rotates 
Xi to Ei=i RijXj, i.e. TT{RyXiTT{R) = E'=i ^u^i- 

The Lie algebra >^ so (3) of the Euclidean motion group has zero second 
cohomology, so tt induces a linear representation tt on the level of Lie algebras. 
The standard basis ex,ey,ez for and Tx,Ty,Tz for so (3) gives rise to the 
momenta Pfc := J(efe) and the angular momenta L)t := J{Tk)- The commutation 
relations between the Pk and L; arc then easily obtained from the Lie bracket 
on R? X so (3), and their commutator with the Xj~ follows from the requirement 
that the 77(5) shift or rotate the X^. This yields the following extension of the 
canonical commutation relations: 

[Xk,Xi]=Q, [Xk,Pi]=i5kil, [Pfc,P;]=0 (2.6) 

[Xk, Li] = ieklmXm , [Lk, Li] = iCklmLm , [Lk, Pi] = if-klmPm ■ 

This can be achieved by taking the Hilbert space % = i^(R^), with the 
observables {Xkij){x) = Xkij^{x) and the unitary representation {■jT{g)tl)){x) = 
ip{g~^x). The corresponding observables 

Pk = -i^ — , Lk = -I 2_^eklmXlT. — 

can easily be seen to obey 2.6. 

Note that we have established all of this without any reference to the dy- 
namics whatsoever. In other words, everything up to this point is indepen- 
dent of the particular Hamiltonian. A Hamiltonian usually takes the shape 
H = -|- V{x), with V the potential function. This gives rise to the 
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Schrodinger equation 

k=l 

It is the choice of Hamiltonian that determines which transformations axe 
symmetries. For example, the hydrogen atom is described by the potential 
V{x) = —l/\/x1 + x| + x|. In this case, the group of Euchdean motions 

x: 0(3) is not a group of symmetries, but its subgroup 0(3) of orthogo- 
nal transformations is. This means that the angular momenta Lk are conserved 
quantities, whereas the momenta Pk are not. 

2.2 Quantum Probability Theory 

Quantum probability theory (QPT) is a mathematical framework which gen- 
eralizes both quantum mechanics and classical probability theory. It is ideally 
suited to describe open quantum systems, and in particular quantum measure- 
ment. 

We first describe quantum probability theory on a finite dimensional Hilbert 
space H. This allows us to focus on the algebra, postponing the functional 
analysis involved in the infinite dimensional case to later. 

2.2.1 QPT on Finite Dimensional Hilbert Spaces. 

A subalgebra A of B{'H) is called a *-aIgebra if it is closed under the adjoint. 
In quantum probability theory, a system is modelled by a *-algebra A C B{H). 
The observables of the system are described by self-adjoint operators A in A, 
and the states of the system are modelled as follows. 

Definition 2.3 Let % he a finite dimensional Hilbert space, and A a *-suh- 
algebra of B{H). Then a state on A is a positive, normalized, linear functional 
p: A^ C The space of states is denoted S{A). 

One should think of a state as a map that assigns to each observable A its 
expectation p{A). Since the expectation should be M-linear on the observables, 
we may as well extend it to a C-Iinear map on all of A. Positivity means that 
^ > implies p{A) > 0. This is reasonable; if A has nonnegative spectrum, 
then its expectation should also be nonnegative. Normalization, p(l) = 1, 
means that the expectation of 1 is 1. 

Quantum Mechanics 

We have already seen that in our garden- variety quantum mechanics, every self- 
adjoint operator corresponds to an observable, so that A ~ B{H). Any unit 
vector tp eH gives rise to a pure state p^ {A) = {tp \ Aip) , but it turns out that 
not all states are pure. 
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Theorem 2.7 IfK is finite dimensional, then any state p : BCH) C can be 
written 

piA) = tr{RA) 

for a unique positive operator R with tr{R) = 1, called the density matrix. 

Proof. Because {A, B) tY{AB) is a perfect pairing, any linear functional 
p : B{T-L) — > C can be uniquely written as p{A) = tr(i?A). HE — is the 

projection onto a 1-dimensional subspace C^, then p{E) = {ip \ Rtp). Positivity 
of p therefore impUes R>0, and normalization corresponds to tr{R) = 1. □ 

The pure states p^ correspond to 1-dimensional projections R = |V')(V'I- They 
are the extreme points of the convex state space S{B{H)). 

Indeed, if we choose an orthonormal basis tpi of eigenvectors of R, then we 
can write R = ^^^iPi\4>i){4>i\- Positivity implies Pi > 0, and normalization 
implies ^^f^^Pi = 1- Any state on B{H) can therefore be written as a convex 
combination of vector states p = ^^PiPiPi, albeit not necessarily in a unique 
fashion. 

This invites us to interpret a system in state p as one that is with probability 
Pi in state p^^ . We will do so, and even elevate this to the more general stochastic 
equivalence principle, which holds for convex linear combinations of any two 
states pi and p2 on any system A. 

A system in state ppi + (1 — p)p2 cannot be distinguished from one that is in 
state pi with probability p, and in state pi with probability 1 — p. 

Classical Probability Space 

Let (fi,P) be a finite probability space. A random variable is a map / : f2 — )• C. 

It makes sense to identify two random variables / and /' if they are equal almost 
surely, P([/ ^ /']) = 0, and we write L°°(f2,P) for the resulting algebra. The 
expectation defines a linear map E : Z/°°(f2,P) — >■ C. 

The Hilbert space P) is the same vector space, equipped with the inner 

product ((/) I V') = E(^'(/'). Because P) acts on L^(ri, P) by multiplication, 

we can consider it as a commutative *-subalgebra of S(L^(0,P)). This places 
us in the framework of quantum probability theory, with A = L°°(0,P), T-L = 
L-^(ri,P), and p = E. Let us describe the state space S{A). 

Theorem 2.8 Any finite probability space (r2,P) gives rise to a finite dimen- 
sional Hibert space L^in,V), and a *-subalgebra L°°{n,V) of B(L^{n,V)). IfF' 
is absolutely continuous w.r.t. P, then it defines a state E' : L°°{Q,F) C. All 
states arise in this way. 

Proof The set {d^ ; to e Q, V{lo) ^ 0} constitutes a basis of L^{n,¥), so that 
any state p can be written p = X^pL'^cI- Positivity and normalization amount 
to p'_^. > and = 1 respectively. The numbers p'^ thus define a probability 

measure P' P, and p = E' is its associated expectation. □ 
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Operations 

We wish to describe transformations of a system A C B^H) to a system B C 

B{H'). In the context of quantum probabihty theory, these are called operations. 
Each operation is determined by a map r : S{A) S{B). We investigate which 
maps can reasonably be interpreted as operations. 

We interpret the convex combination ppi + (1 — p)p2 G S{A) as a system 
that is in state pi with probability p, and in state p2 with probability (1 — p). 
This means that after the operation, the system B must be described as being 
in state t(pi) with probability p, and in state r(p2) with probability 1 —p. Thus 

t{ppi + (1 - p)p2) = Pt{pi) + (1 - p)t(p2) • 

This means that r extends to a linear map t : A* B* . We can consider it as 
the dual r = T* of a linear map T : B A, and will do so from now on. The 
requirement that T* map states to states implies that T{B) > if B > (it 
preserves positivity), and that T(l) = 1 (normalization). 

Definition 2.4 Let A and B be *-subalgebras of BlU) and B{'H') respectively. 
A map T : B ^ A is called positive if it is linear, if B > Q implies T{B) > 0, 
and ifT{l) = 1. 

One would be tempted to interpret each positive map as an operation, but this 
is not what one should do. There is one more requirement that we must impose 
on T in order for it to qualify as an operation. 

If T : B ^ A describes an operation from A to B, then the map T id„ : 
B (g) S(C") A(E) Z?(C") describes the act of transforming A to B, while doing 
absolutely nothing on the system B{C^). Surely this should map states on 
A (S) B(C") to states on B (g) B(C"), and we therefore require that T (g) id„ be a 
positive map for all n £ N. 

Definition 2.5 A map T : B ^ A is called completely positive if 

T (g) id„ : S ® S(C") ^ (g) S(C") 
is positive for all n G N. 

Perhaps surprisingly, this is not automatic. There exist maps which are posi- 
tive, but not completely positive. We shall model operations from ^ to B by 
completely positive maps T : B ^ A. (CP-maps for short.) 

Operations in Classical Probability Let (17, P) and (fl' ,P') be finite prob- 
ability spaces. Then a transition kernel is a positive function P on O' x such 
that J2^^, P(u;',cj)r(w') = 1 for all cj e O. One should think of P{lo' ,lo)V'{lo') 
as the probability that a system in state lo makes a transition to w'. 

An operation T : i°°(ri',P') L°°(f7,P) is related to such a kernel. In- 
deed, if we write Tf{uj) = J2^-^, P{iu' ,uj)f{uj')r {oj'), then positivity of T entails 
P{uj',uj) > 0, and normalization T(l) ~ 1 translates to ^j^, P (oj' ,lj)P' (uj') = 1. 
An operation is precisely an equivalence class of kernels, if we deem two ker- 
nels to be equivalent when their difference is nonzero only on those (w', w) with 
either P(w) = or P'(a;') = 0. 
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Operations from Quantum to Classical Systems A completely positive 
map T : L°°(ri,P) B{H) is determined by its values on 5^^, i.e. by the map 
E -.n^ B{n) defined as E{uj) = T{5^). 

Definition 2.6 A positive operator valued measure or POVM is a map E : 
n — )• that satisfies 

- E{uj) > for allujefl 

The map E{oj) = T{6u) is a POVM; positivity of T implies the first require- 
ment, and normalization the second. Conversely, any POVM that is measur- 
able w.r.t. P determines a CP-map T : L°°(rj,P) B{n). A CP-map is a 
*-homomorphism if and only if its POVM is projection valued. 

Operations between Quantum Systems Any *-homomorphism of alge- 
bras is a CP-map. In particular, the transformations au{A) = W AU of B{H) 
are operations in the sense of quantum probability. 

Another type of operation is considering a subsystem as part of a bigger 
whole. If 'H is a Hilbert subspace of /C, then it comes with the inclusion V -.l-L 
K., and the adjoint projection : K. ^ n. The CP-map T : B{]C) B{n) 
corresponding to this inclusion is given by T{A) = V"^ AV. 

According to the following classification theorem, any CP-map is a combi- 
nation of a *-homomorphism and an inclusion. 

Theorem 2.9 (Stinespring) Let H and H' be Hilbert spaces of finite dimen- 
sion, let A be a *-subalgebra of B{H'), and let T be a CP-map A B{H). Then 
there exists a finite dimensional Hilbert space IC, an inclusion V : % ^ K and 
a *-homomorphism w : A ^ B{JC) such that T{A) — V'^'k{A)V . 

B{JC) 




Proof. Let ICq = A®T-L. Then the inclusion Vq : "H — > /Cq is given by ?/; i-)- l^ip, 
and if A e .4, then 7ro(A) : /Co — )■ /Co is defined by i? (g) i/' AB (g) ip. We equip 
/Co with the sesquilinear form defined by {B ® ijj \ B' (E) i)') = {^J \ T{B^ B')^)')^- 
Let F = X]i=i ® '^i arbitrary element of Kq. In terms of the map 

T(g)idk:A(g) B(C'=) B{n we can write 

(F|i^) = (*|T(gidfc(s^s)4') , (2.7) 

with * ^ X^^ti tpt ei in n ^ C' and B = ^Jl^ B, ® |ei)(e,| in A(E) S(C'=). 
Because T(E)idk is a positive map, we conclude that {F \F) > 0. Our sesquilinear 
form is positive semidefinite. 
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We can therefore define the Hilbert space JC = JCq/N, with M the kernel 
of (• I •). The map Vq is an isometry w.r.t. (• | •), and therefore induces an 
inclusion V : % ^ JC oi Hilbert spaces. Its adjoint projection is given by 
yt(B0^) =T(B)V'. 

The map ttq : .4 — > /8(/Co) is a contraction. Indeed, if we replace F by 
7ro(A)F in equation (2.7), then on the right hand side, B} changes into the 
expression {A^ A <S) 1)5. Since this is dominated by we have 

{Tro{A)F\Tro{A)F) < \\A\\'^{F\F), and the operator norm of 7ro(A) does not 
exceed that of A. 

In particular, Af is invariant under 7ro(A), so that we have a map ir{A) : 
/C — > /C. It is then straightforward to check that tt is a *-homomorphism, and 
that V^Tr{A)V = T{A). □ 

2.2.2 QPT on General Hilbert Spaces 

We define quantum probability theory in the infinite dimensional setting, which 
requires a little bit of functional analysis. 

Von Neumann Algebras 

We have seen that in quantum mechanics, all the structure of a system is en- 
coded in its algebra B{H) of bounded operators. A subalgebra of S('H) is called 
a von Neumann algebra if it is closed not only under the adjoint A A^, but 
also in the weak topology. In quantum probability theory, a system is modelled 
by a von Neumann algebra A. 

A self-adjoint, possibly unbounded operator A on is said to be affiliated to 
A if all its spectral projections are in A. Observables of the system are modelled 
by self-adjoint operators affiliated to A. 

If A and B are von Neumann algebras, then we will denote by A® B the 
von Neumann algebra generated by the algebraic tensor product. 

Normal States 

A state is again described by a normalized positive linear map p : A ^ C, but 
it is wise to impose a continuity condition. If A is affiliated to A, then we would 
like its spectral measure E : Spec(^) ^ ^ to induce a probability measure 
Fp = poE on Spec(A). But although E is cr-additive, i.e. Y.^E{Si) = E{\J^Si) 
in the weak topology, this need not hold for fp=poE, unless we impose the 
following continuity requirement. 

Definition 2.7 A state p : A ^ C is called normal if it is weakly continuous 
on the unit ball {AG A; \\A\\ < 1}. 

A normal state p € S{A) induces a probability measure P^, on the spectrum 
of any self-adjoint operator affiliated to A, so that we have our beloved map 
(p, A) !->• Pp. With respect to Pp, we have of course E(A) — p{A). 
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Normal States in Quantum Mechanics The normal states of an ordinary 
quantum system B(^) can again be described by a density matrix. 

Theorem 2.10 Let V. be a separable Hilbert space. Then any normal state p 
on B{T-L) can be written 

p{A) = tT{RA) 

for a unique positive trace-class operator R onH with tr{R) = 1. 

Proof. See for example chapter 7 of [KR86]. □ 

A positive normalized trace-class operator admits a basis of eigenvectors with 
nonnegative eigenvalues pi summing to 1. In other words, any state can be writ- 
ten as a countably infinite convex combination of pure states, p = Y^'j^iPiPiPi- 

Normal States in Classical Probability A classical probability space, de- 
noted (f2,S,P), gives rise to the algebra S,P) of essentially bounded 
random variables on O. i.e. bounded measurable functions up to the equiva- 
lence ~, with / ~ g if / = ^ almost surely. 

Since L°°{Sl,T,,F) acts on the Hilbert space L^(0,S,P) by multiplication, 
we can regard it as a commutative *-subalgebra of S,P)), which turns 

out to be weakly closed. 

Theorem 2.11 The algebra E,P) is a commutative von Neumann sub- 

algebra o/ ,B(L^(ri, S, P)). // P' is absolutely continuous w.r.t. P, then the 
expectation E'(/) = f (dcj) constitutes a normal state on L°°(f2, E,P), 
and every normal state arises in this way. 

Proof. See e.g. [Maa03] and the references therein. □ 

In short, a classical probability space (O, S, P) gives rise to a commutative von 

Neumann algebra, and a normal state corresponds to a probability measure 
P' <C P. Essentially every^ commutative von Neumann algebra arises in this 
way. 

Operations 

The definition of a completely positive map remains unaltered. A linear map 
T : B ^ A between von Neumann algebras is called positive if T(l) = 1 
and if _B > implies T{B) > 0. It is called completely positive if T (g) i(l„ : 
B ® e(C") A(g) e(C") is positive for all n € N. This already imphes that 
T (g) id : B (g) B('H) — >■ ^ (g) S('H) is positive for arbitrary y., so that we need not 
add this as an extra requirement. 

We do however require T* to map normal states to normal states. We 
therefore model operations by weakly continuous CP-maps. 

^To be more precise, a state p is called faithful if A > and p{A) = implies A = 0. 
Any commutative von Neumann algebra A with a faithful normal state p is isomorphic to 
L°°(r2, S,P), and the state p corresponds to the expectation w.r.t. P. 
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Operations in Classical Probability Let (0,S,P) and (fi',E',P') be two 

probability spaces. A transition kernel is a positive measurable function P on 
n' X n such that J^^, P{uj' ,u})¥' {du') = 1 for all w G We will identify two 
transition kernels if they are equal almost surely w.r.t. P' x P. One should think 
of P{uj' ,uj)F'{dijj') as the probability that the system ends up in du' , provided 
that it starts in w. 

Completely positive maps between classical probability spaces correspond 
precisely to transition kernels. 

Theorem 2.12 A transition kernel P gives rise to the weakly continuous CP- 
map Tp : L°°(f7',S',r) ^ L°°(ri,S,P) byTp{f){uj) = J^, f{uj')p{uj,uj')r {duj). 
Every weakly continuous CP-map T : L°°(n', S',r) L°°(f2,S,P) is of this 
form. 

Proof. We prove the second statement. The map T gives rise to the normal 

state p on S',r) S, P) defined by p{f ® f) = Er{fT{f')). Ac- 

cording to theorem 2.11, this corresponds to a probability measure on fi' x f2 
which is absolutely continuous w.r.t. P' x P. Its Radon-Nikodym derivative, 
well defined up to equivalence, is a transition kernel P. The requirement 
J^, P{oj' ,oj)F'{duj') = 1 corresponds to the fact that P is the marginal prob- 
ability distribution on ft, as is clear from p(l (g) /) = Ep(/). □ 

Operations from Quantum to Classical Systems We have seen that a 
projection valued measure E : T, ^ B{H) gives rise to a *-algebra homomor- 
phism Ie : L°°{Q, S) B{n), by way of 

Mf) ■■= [ f{oj)E{dLo) . (2.8) 
Jn 

If ^ P, this factors through a weakly continuous homomorphism Ie ■ 
S,P) B{T-L) of von Neumann algebras. Conversely, any weakly contin- 
uous homomorphism T : L^'{il, S,P) — t- B{H) of von Neumann algebras gives 
rise to a PVM by way of E{S) = T{ls). 

In order to capture all CP-maps, not just the homomorphisms, we introduce 
the notion of a Positive Operator Valued Measure or POVM (cf. def. 2.6). 

Definition 2.8 Let {fl, S) be a measurable space, and T-L a Hilbert space. Then 
a POVM is a map E -.T, ^ with the following properties. 

- E{S) > for all 5 e S. 

- E{n) = 1. 

- If Si is a sequence of disjoint sets in S, then J2iLi ^i^i) = E{\J°^^ Si) 
in the weak topology. 

A weakly continuous CP-map T : L'^'{fl, S, F) Bin) corresponds to a POVM 
£; < P by T{f) = f{x')E{d,Lj). The POVM is a PVM (cf. def. 2.2) if and 
only if T is a homomorphism. 
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Operations in Qucuitum Systems Although every homomorphism of von 

Neumann algebras is a CP-map, it is certainly not true that every CP-map is a 
homomorphism. We do however have the following theorem. 

Theorem 2.13 Every CP-map T : B ^ A with a completely positive inverse 
is an isomorphism. 

See page 76 for the proof. The group of invertible operations of a system 
A is thus simply Aut'^*(^), the group of weakly continuous automorphisms. 
Its action on the state space «S(^) equips it with the topology of pointwise 
convergence in trace distance. (See page 12.) From the point of view of quantum 
probability theory the action of a Lie group G should therefore be described by 
a continuous homomorphism tt : G — )• Aut'^*'{A). The following theorem shows 
that Aut'=*(^CH)) = PU{H). 

Theorem 2.14 (Wigner) Let H be a separable Hilbert space. Then every 
weakly continuous automorphism a ofB(Ji) is inner, and can be written a{A) = 
WAU for some unitary U. 

Proof. Because a* maps normal states to normal states, we can consider it as a 
continuous trace-preserving C-linear map on the trace-class operators T{H), cf. 
theorem 2.10. Because a* maps R to a~^{R), it restricts to a map on the pure 
state space 'P{'H). and a. is determined by this restriction because of continuity. 
The map on V{H) preserves the pairing (C^,C^/^) = K*/) | V") 1/1101111^11 because it 
preserves the trace. According to Wigner's classification of the automorphisms 
of V{'H), any such map must be given by a*{\tlj) {tl)]) = U\^/j) {t/j\U^ with U either 
unitary or anti-unitary. The map U ■ W can only be extended to a C-linear 
map on T('H) if U is unitary, or of course if H is one-dimensional, in which case 
the theorem is trivially true. □ 

An action of G by weakly continuous CP-maps on B{T-L) is therefore precisely a 
projective unitary representation. In particular, invertible time evolution on a 
closed quantum system is still described by a Hamiltonian, and we do not gain 
anything new. 

Non-invertible time evolution on an open quantum system however should 
be described by a continuous homomorphism of the semigroup (M^o,+) into 
the semigroup of weakly continuoTis CP-maps on B{'H). which will in general 
not be automorphic. CP-maps are classified by the following theorem. 

Theorem 2.15 (Stinespring) Let Abe a von Neumann algebra, H a Hilbert 
space, and let T be a CP-map A — )■ B{T-L). Then there exists a Hilbert space 
K, an inclusion V : % ^ K and a * -homomorphism tt : ^ — > B{K) such that 



If we impose that IC be the closure of 'jt{A)'H, then (tt, y,/C) is unique up to 
unitary transformation, and T is weakly continuous if and only if tt is. 



T{A) = V'<7t{A)V. 



B{JC) 



TT 




Proof. See Stinespring's paper [Sti55]. 



□ 
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Summarizing 

We arrive at the following picture of quantum probability theory. A system is 
modelled by a von Neumann algebra A of bounded operators on a Hilbert space 
H. An observable is modelled by a self-adjoint operator affiliated to A. A state 
on this system is described by a normalized positive linear map p : A C that 
is weakly continuous on the unit ball, and an operation from ^ to B is described 
by a weakly continuous completely positive map T : B A. 

2.3 Quantum Stochastic Differential Equations 

This concludes our description of quantum probability spaces in general. We 
will now focus on a single model that will be of particular relevance to this thesis: 
the second quantized electromagnetic field. Its Hilbert space J^(I/^(M"'")) is the 
Fock space of L^iR+). 

2.3.1 Fock Space 

If "H is a Hilbert space, then so is its n-fold symmetric tensor power 'H®^". We 

set n'^-° := C, and define the Fock space over "H as follows. 

Definition 2.9 The (symmetric) Fock space ^-"('H) over a Hilbert space H is 
defined as 

oo 
n=0 

the Hilbert space of sequences n i->- w„ with w„ e 'H®"" and X]n(^n I ^n) < oo. 

In many respects, taking H into J^{H) can be seen as exponentiating a Hilbert 
space. For example, the set of exponential vectors {e{v) ; v e %}, with 

oo _j 

n=0 v'T'- 

is linearly independent and dense in F{V.) (see [Par92]). They are easily seen 
to satisfy 

(e(w) I e{w))jr(^) = cxp((w | w)h). 
We will denote the vectors w*^" by \v)n- The normalized exponential vectors 
are called coherent states. We denote them by \v) := e~2^'"\'")e{v). 

Proposition 2.16 There is a natural isomorphism of Hilbert spaces 
under which e{vi ® V2) — e{vi) 6(^2). 

Proof. Since the exponential vectors are dense and linearly independent, the 
expression e(vi ® ^2) 1— )■ e{vi) (g) e(w2) yields a densely defined linear map. 
Since both {e{vi V2) \ e{u!i © W2)) and (e(t'i) ® ('{V2) \ (8) e{w2)) equal 

exp((t;i I wi) + {v2 I W2)), it extends to a unitary isomorphism. □ 
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By the same token, it is clear that an isometry U : Hi ^ ^.2 induces the 
isometry F{U) : F{T-Li) — > T{7i2) defined by F{U)e{v) — e{Uv). Because 
J^{U' o U) = -FiU') o J^{U), one can view as a functor from the category of 
Hilbert spaces to itself, taking direct sums into tensor products^. In particular, 
Tiji) carries a unitary representation of Ui^). 

2.3.2 Weyl Operators, Fields and Momenta 

The unitary representation U ^ J^{U) of J7('H) extends to a projective unitary 
representation of the group U {H) with product {U, v){U', v') = {UV, Uv'+ v). 

Proposition 2.17 The Weyl operators W{U,v), defined by 

W{U,v)e{w) = eicp{-l\\vf - {v \ Uw))e{Uw + v) , (2.9) 
constitute a projective unitary representation of Uiji) x %, satisfying 

W{U, v)W{U',v') = exp(-am(w | Uv'))W{UU', Uv' + v) . (2.10) 



Proof. Since {W{U,v)ew\W{U,v)e{iu')) = {ew\ew'), equation (2.9) does in- 
deed define a unitary operator. Equation (2.10) is then readily verified on the 
dense set of exponential vectors. Continuity essentially follows from continuity 

of the map v i— > e{v), see [Par92] for details. □ 

We will denote W{1, v) by W{v), and designate W{U, 0) by its old name J^{U). 
If is finite dimensional, we find a projective Lie algebra homomorphism W : 
su{H) K "H — > su{T{'H)), cf. section 2.1.2. Rather than making this precise for 
infinite dimensional H, we will restrict attention to Lie subgroups of U {H) K H 
of dimension 1 or 2. 



Associated Observables 

Every self-adjoint operator H gives rise to a one-parameter Lie group in U{'H) k 
"H, as does every vector v & H. The restriction of W to these one-parameter 
groups is a linear unitary representation. We can therefore define the skew- 
symmetric closed operators 

jr(fl-) = jr(e'*^) j^H^~"*") (2-11) 

and 

Wiv) = W(-tv)^W(tv) . (2.12) 
at 

The observable X{H) := i T{H) is called the difi'erential second quantization of 
H. We will not be needing commutation relations involving it. (See however 
[Par92].) The operators J{v) = iW{v) are called fields and momenta. 



^In the words of E. Nelson: 'Quantization is a mystery, but second quantization is a 

functor.' 
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They satisfy the canonical commutation relation 

[J{u), J{v)] = 2ilm{u I , (2.13) 

as can be seen by differentiating (2.10). Since every exponential vector is in the 
domain of every product J{vi) . . . J{Vn), the l.h.s. of equation (2.13) is densely 
defined. 



Fields cind Momenta 

As is the case in Haniiltonian mechanics, the division of the J{v) into fields 
and momenta is to a certain extent arbitrary. If we choose a completely real 
subspace Hu, then we define for / e Hr the fields to be 

and their conjugate momenta 

n(/) = -j{if) . 

They satisfy the commutation relations 

[$(/),$(5)] = o, mf),u{9)] = Mf\9), [n(/),n(5)] = o. 



Creation Etnd Annihilation Operators 

Even without a completely real subspace, we can introduce for any u G H the 
creation and annihilation operators 

at(M) = -i(J(m) +iJ(u)), a{u) = -l{J{iu)-iJ{u)), (2.14) 

respectively, satisfying 

[a{u),a{v)]=0, [a{u),a^{v)] = {u\v)l, [a^ {u) , (v)] = . 

By sandwiching (2.9) between exponential vectors and applying equation (2.12), 
we obtain: 

{ev\a{u)e{v')) = {u\v'){ev\ e{v')) , (2.15) 
{ev\a^{u)e{v')) = {v\u){ev\e{v')) . (2.16) 

We will also have use for the second order correlation functions 

{ev\a\ui)a^{u2)e{v')) = {v\ui){v\u2){e{v)\ev') (2.17) 

{ev\a{ui)a{u2)e{v')) = {ui\v'){u2\v'){ev\e{v')) (2.18) 

{ev\a^iui)a{u2)eiv')) = {v\ui){u2\v'){ev\e{v')) (2.19) 

{ev\a{ui)a^{u2)e{v')) = {ui\v'){v\u2){ev\e{v')) (2.20) 

+ {ui\u2){ev\e{v')) . 

The seemingly innocuous hiccup at (2.20), essentially due to the fact that a 
and fl^ do not commute, is what will eventually make the quantum stochastic 
calculus of section 2.3.8 work. 
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2.3.3 The Harmonic Oscillator a la Fock 

It is instructive to see how all of this works out in the simple case J^{C). This 
describes the harmonic oscillator, i.e. a particle moving in a potential V{x) ^ x^. 

The n-fold symmetric tensor product C*^"" of C is one dimensional, and we 
choose the unit basis element 1 1 )„ = 1 (g) . . . (g) 1. We denote it | n ) := 1 1 )„ for 
brevity. These combine to form the 'particle basis { | n ) ; n e N } of the Fock 
space 

oo 

In terms of this particle basis, we have the exponential vectors 
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Conversely, we can recover the particle basis from the exponential vectors by 

1 rf" 

\r^} = —:^.l^^e{z). (2.21) 



Let us investigate what the observables A(/i) = iJ^{h) and J{v) = iW{£,) look 
like in the particle basis. (They combine to form a projective representation of 
the three-dimensional real Lie algebra su(C) k C.) 

The 1-dimensional unitary group U (C) acts on the exponential vectors by 
W{e-^^)e{z) = e{e-'^z). From equation (2.21), we see that W{e-'*^)\n) = 
e-'"*^\n}, so that 

X{h)\n) =nh\n) . (2.22) 

We denote the second quantized Hamiltonian hy H = A(l). The particle basis 
is a basis of eigenvectors of H. 

Using equation (2.21) and the fact that J{v) = i-^\oW{tv), a short calcula- 
tion starting from 

W{tv)e{z) = exp(-it2|^p - tvz)e{z + tv) 

yields 

J{v) I n ) = iv^/n+ 1 | n -I- 1 ) — ivy/n \n — 1) 
for n > 1, and J{v) | 0) = iv\l). From equation (2.14). we then have 

a{l)\n) = Vn \n-l), a^l)\n) + 1 \ n+l) . (2.23) 

From equation 2.13, we see that [J(l), J(i)] = 2i. If we define X = — J(l) 
and P = —J{i)/2, we have the canonical commutation relation [X, P] = i. The 
creation and annihilation operators can be expressed as 

a{l) = P-liX, a^l) = P+liX 

with [a(l),a^(l)] = 1. 
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We see from equations (2.22) and (2.23) that A(l) = a(l)a^(l) — 1, so that 

This means that J^{C) serves as a Hilbert space for the harmonic oscillator 
in which H is diagonal w.r.t. the particle basis. The Schrodinger equation 
i^'ip{t) = H^{t) is therefore trivially solved, the fundamental solutions are 

^P{t) = e-*"*|n). 

The fact that H is diagonal means that X and P are not, because they do 
not commute with H. We will now consider a representation in which X is 
diagonal, but H and P are not. 



2.3.4 The Hcirmonic Oscillator a la Gauft 

Consider the Hilbert space L^(M,Pg), with Fq is the Gaussian probability mea- 
sure 

V 27r 

It has the characteristic function xg{z) = E(e'^^) = e~5^^. Because the func- 
tions ez{x) = e"^ are in Z/^(]R,Pg) not only for 2; e M, but for all 2; e C, we 
may consider xg{z) as a holomorphic function. 

The inner product (e^ | e^^) = E{eA'^^^^^) equals Xg{w — z) = e^if""^^) . 
The linear span of the ez is dense in L-^(M,P(3), and if we define = es^ e^, 
then {Ez \ E^) = e^"". 

Recall that similarly, the exponential vectors e{z) span J^(C), and that 
{e{z)\e(w)) = e^™. The map e{z) ^ E^ therefore yields an isomorphism 
7 : J"(C) 4 L2(M, Pg). We can now transport the Weyl operators to ^^(M, Pg), 
as well as X , P and H . 

On exponential vectors, the Weyl operators are given by ^W{v)^~^Ez{x) = 
e2^("-^) e^^'' Ez{x+v — v). The E^ are dense, so we must have ^W{v)^~^%Ij{x) = 
P2v(v-v)pivx^^^ + w — w) for all tp € L^(M,Pg). The corresponding obscrvables 
arc then obtained from J{v) = i-^lojW {tv)^~^ . We get J(l) = — x and J{i) = 

i(2^ —x), so that X = x, P = —i-^ + ^ix and H = ^^'li- conclude 
that indeed, in the Gaussian picture, X is diagonal. 

Although H is not diagonal here, we can use 7 to transport its basis | n) of 
eigenvectors to L^(K, Pg). The particle basis then transforms into the Hermite 
polynomials 



1 d" 



which apparently satisfy (— ^+a;^)V'n = n-^n- Schrodinger's equation i^V' = 
(~(^ ^'Sx^'^ therefore has the fundamental solutions e~''"'*ipn{x). 



2.3.5 The Wiener Measure 

We wish to obtain a Gaussian representation of the Fock space J^(L^(M+)). 
in which the fields $(/) for all real functions / are simultaneously diagonal- 
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ized. The germane probability distribution turns out to be the so-called Wiener 
measure, obtained from the random walk in the limit of small step size. 

The Random Walk 

The random walk is the following stochastic process. Divide the time line M"*" 
into equal intervals of length Af. At the start of each time interval, flip a coin 
to step either left or right by an amount V At. If we let At tend to zero, then 
according to the central limit theorem, the position at time t will have a 
Gaussian distribution with /x = and a = \ft. 



t 

B 




Fig. 1: Random walk, produced with a 0,05€ random bit generator. 



For each finite step size, the random walk places a probability measure Pa* 
on the set of possible paths starting at 0. The Wiener measure P^y is its 

limit as At tends to zero. It lives on C5(IR"'"), the space of continuous paths 
B : M+ — )■ M that start at and grow at most polynomially. We briefiy sketch 
its construction. 

Measures on the Space of Tempered Distributions 

Because the increments AB^ rather than the endpoints Bt are independent, it 
is natural to view Bt as the 'noise' = ^B^ integrated from to T, and then 
consider Fyy as a probability measure on the set = {N = ^B; B G C5(M+)} 
of noises. 

Unfortunately, not every continuous function is differentiable. Because one 
cannot expect the erratic {ABt ~ V~Ai) paths to be differentiable, we cannot 
solve this problem by restricting the space of paths. We therefore define the 
noise Nt as the tempered distribution 

N{cl>) = - Bt-Ht, 

a continuous linear functional on the space Cg°(M"'") of rapidly decreasing func- 
tions vanishing at 0. Using highly illegal partial integration, we write this 
symbolically as N{(j)) — (f>{t)dBt. 

Having realized our noises as a subset Q, C r'5(]R"'") of the tempered distribu- 
tions, we can (and will) consider the Wiener measure as a measure on 2?5(M+) 
with Pw(f^) = 1. 
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Now for any probability measure P on ■D5(]R+), we define the characteristic 
function x : C's°(M+) C to be 

Xicfy = [ exp{iF{cP))F{dF) . 

JX's(R+) 

According to the following theorem, a probability measure is completely char- 
acterized by its characteristic function. (Hence the name.) 

Theorem 2.18 (Bochner-Minlos) A functional x ■ Cg°(M+) C is the 

characteristic function of a probability measure on the space 2?5(IR"'") of tempered 
distributions if and only if it is continuous, if x(0) = 1 and if it is positive 
semidefinite, i.e. ZiZjx{(f>i — <Pj) > for all zi . . .Zm € C, 0i . . . (f)m S 

C|°(M+). 

Proof. Any characteristic function is continuous, and x(0) = P(^s(]^''')) is 
equal to 1. It is also positive semidefinite, as 

V ZiZj [ exp{iF{(l>i - ct)j))F{dF) = [ | V exp(jF((/)i)) | 

is certainly nonnegative. For the converse statement, which is of course the 
hard part, see e.g. [GV64, p. 322]. □ 

Our strategy is now to calculate the characteristic function XAt of the random 
walk process with step size At, and show that it tends to a limit xw as At 
tends to zero. We then use theorem 2.18 to define Pvy as the unique measure 
characterized by xh^- 

The Wiener Mecisure 

Calculating XAt is rather straightforward. If we think of w G {±1}^^ as an an infi- 
nite coin toss, then Bt{oj) = Ns{oj)ds, with N{ui) = J2kLo 7^1[feAt,(fe-M)At) • 
As a distribution, this reads 

k=0 VAtJkAt 

Then 

XaM) = lim 2- E ^ / '^(^)^^ 

/ ^ Ak+l)At \ 
= lim 11 cos , / 6{s)ds 1 . 

This means that as the step size At tends to zero, the characteristic function of 

the random walk process approaches the limit XAt(</') 6xp(— ^ (j)'^{s)ds). 
The fimction Xw{4>) = <^xp(— ^ f^^ (t>^{s)ds) is indeed continuous and positive 
semidefinite, so that we may finally define the Wiener measure. 
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Definition 2.10 The Wiener measure ¥w is the unique probability measure 
Fw on Ps(K+) with characteristic function Xw{(l>) = exp(— g (lP{s)ds). 

One can show that the Wiener measure concentrates on the set of noises, 
Fiy{Q) = 1. It therefore induces a probabihty measure on Cs(M+), which 
we denote again by Fw It is the unique one with the following properties. 

- Every increment Bt^ — Bt^ has a Gaussian distribution with = and 

= ^/t2 — t\. 

- The increments are independent, i.e. for any sequence ti < ... < ffe of 
times, the steps Bf.^-^ — Bt. are mutually independent. 

- The increments are stationary, i.e. the joint probability distribution of the 
steps {Bti — Bsi), . . . , (-Btj, — Bg^) does not change if all ti and Sj are 
shifted by the same amount. 

2.3.6 Stochastic Differential Equations 

Consider Bg as a random variable on the space il C I?5(M^') of noises. Let 
be the u-algebra generated by {Bg ; < s < t}. The cr-algebras with s € R"*" 
filter n, in the sense that Et is finer than if t > s. 

A process is a measurable map Y : fl x M+ — > M. A process is called 
adapted if Yf is measurable w.r.t. Et. One should think of an adapted process 
as one in which Yt depends only on the paths Bg up to time t. We denote by 
L'^{fl X 1R+, St) the Hilbert space of adapted processes that are square integrable 
w.r.t. the product of Pvi- and the Lebesgue measure. 

An adapted step function is an adapted process of the form 

oo 

Yt{w) = J2ck{oj)l[t„t,+,){t) 

fe=0 

for some increasing sequence tk with to = and tk oo. Adaptedness implies 
that the Ck are measurable w.r.t. S^j.. If Y is an adapted step function, and 
T = tn, then we define the stochastic integral 

r Y,dB, = |j c,(u;)(Bt,^, (c.) - B^, (u;)) . 

■^0 fe=0 

If T is not one of the tk, we simply make a subdivision to fix this. 

Theorem 2.19 (Ito) The adapted step functions are dense in L^(0 x M+, Sj), 
and the map Y ^ Y^dBg on step functions extends to an isometry I : 
L'^{n X E+,St) L'^\n,Fw)- We denote /(Flfo,*]) by J^YsdBg. The process 
Zt = JgYgdBs has a version with 1 1— >■ Ztito) continuous a.s. 

Now that we know how to integrate adapted processes, we can define the stochas- 
tic differential equation 



dXt ^ A{Xt,t)dt + S{Xt,t)dBt 



(2.24) 
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to be just shorthand for the integral equation 

Xt = Xo+ I A{Xt,t)dt + / S{Xt, t)dBt . 
Jo Jo 

If A and 5 both satisfy \ f{x,t)\ <C(l + |a;|) and \ f{x,t)-f{x',t)\ <D\x-x'\ 
uniformly for t € [0, T], then a Picard iteration scheme shows that equation 
(2.24) has a unique continuous adapted solution Xt for any given Xq. 

Theorem 2.20 IfXt and X^ satisfy dXt = A{Xt,t)dt+S{Xt,t)dBt anddX^. = 
A'{Xi,t)dt + S'{X[,t)dBt, then XtX[ satisfies d{XtX[) = XtdX[ + dXtXt + 
dXfdX'f, where dXfdX^ should be evaluated according to the following ltd table. 



X 


dt 


dBt 


dt 








dBt 





dt 



In other words, 

d{XtX't) = {XtA't + AtX't + StS't) dt + {XtS't + SX't) dBt . 

This is what ultimately makes stochastic differential equations tractable. We 
can just do calculus, as long as wc keep track of the second order terms. 
For example, let us solve the stochastic differential equation 



dXt = XtdBt (2.25) 

with initial condition Xo = 1. One would expect exponential behaviour. For any 
function /, we have df{Xt) = f'{Xt)dX, + \f" (Xt){dXtf . This means that 
exp(i3t) will not quite be a solution; dexp(_B/) = exp{Bt)dBt + ^ exp{Bt)dt. 
We correct this, and try Xi = cxp{Bt — ^t) instead. This does the trick; 
dexp{Bt-^t) = {dBt-\dt)exp{Bt-\t) + \exp{Bt-\t)dt = exp{Bt-\t)dBt, 
so that Xt = exp{Bt — ^t) is the unique solution of equation (2.25) with Xq = 1. 

2.3.7 Fock Space and the Wiener Meaisure 

We can now realize the fields $(/) as multiplication operators on the Hilbert 
space L^(C5(M+),Pvi/). The construction of the required isomorphism 7 : 
J^(L^(M+)) i^(Cs(M+),Pi^) is analogous to that of the homonymous map 
of section 2.3.4. 

The characteristic function xw ■ CgP(M"'") — )■ C of the Wiener measure 
extends to a holomorphic map xw '■ Cg'{M.~^ ,C) — )• C, given by the same formula 
Xwi'f) = cxp(— i JJ^ 0^(s)(is). The functions : F 1-^ exp(iF((/))) therefore 
have finite expectation xw(0) = ^(e^) for all 4> G C|P(E+,C). They are even 
square integrable. Indeed, 

(e^|e^) = / exp{-iF{^))exp{iF{ip))¥w{dF) 

Jvs{R+) 

= [ exp{iF{,p - 4>)) , 

JVs(K.+ ) 
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so that (e^ | e^) = Xh^(V' — 4')-: and in particular (e^ | e^) < oo. Moreover, in 
terms of the renormalized functions 

E^F) = exp (^l cf,^t)dt^ exp{iF{cji)) , (2.26) 

we have the coveted relation (i?^ | E^) = exp((0 | Since the E^ are dense 
in i^(I>5(]R+),Pv^A), the map e(^) i-)- extends to an isomorphism. It is this 
map. considered as an isomorphism 7 : F{L'^{M+)) ^ L'^ {C gi^'^) ,^w) , that 
realizes the fields $(/) as multiplication operators. 

Theorem 2.21 The isomorphism 7 : J"(L2(M+)) ^ L2(Cs(M+),Pw) sends 
e((/)) to 

7e(0) : B exp (j>{t fd?j exp (^i (j){t)dBt^ (2.27) 

/or a?; e L2(K+,c), and $(/) to 

7$(/)7-' : L{B) ^ (^^" /rfSt) L(B) (2.28) 

for all f G L^(R+,K). Moreover, the probability measure induced on Cs{^'^) 
by the vacuum state e(0) is precisely the Wiener measure Fw 

Proof. If we consider the Wiener measure to be based on Cs (I^^ ) , then equation 
(2.26) and the definition of 7 yield (2.27). Using equation (2.9), one calculates 
that 

-fW{v)-f-^E^{B) = exp ^i v{dB + l\n{v)ds)^ E^{Bt + 2^ Im(t>)ds) . 

Because the E^ are dense, this continues to hold if one replaces E^ by any 
L e L'^{Cs{M.+ ),Vw)- Differentiation then yields^ equation (2.28). 

The vacuum state e(0) induces a probability measure P(/i,... j„) on the joint 
spectrum M" of (<l>(/i), . . . , $(/„)). We show that this coincides with the joint 
probability distribution P|^^ ^ ^ of (/p°° fi{t)dBt, . . . , /q°° fn{t)dBt) under the 
Wiener measure. This suffices to prove the theorem, because the cylinder sets 
{F ; (-F(/i), . . . i^(/„)) e S} with open S' c M" generate the Borel a-algebra of 
Vs{R+). 

The characteristic function of P(jj ...j^) is 

n 

X{fi,---,fn;h,---kn) = (e(0) I H exp(i$(fc,/,))e(0)) . 

r=l 

Because exp(i$(fc/)) = W{kf), and Yl^W{kifi) = W{J2ikifi), we see that 
x(/i, ...,/„; fci, ... fc„) = x(E^ hf^) with x(/) = (e(0) | W{f)e{0)). It equals 
X(/) = exp(-i(/ I /)) by equation (2.9). 

'^Thc conjugate momenta are given by n(/)L(B) = fdBt — 2ig-j-^^ ^{B) , but we 

will not need them here. 
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Similarly, ^^-^ has characteristic function 

[]cxp(iA-, / fr{t)dBt)¥w{dB) . 

This is equal to xw{J2i ^ifi)^ with xw{f) = J cxp{i f{t)dBt)fw{dB). We 
have xw{f) = exp(— by definition 2.10. The probability measures 
^(/i> -,/n) ^'^'i ^(/i /„) '^ow agree because their characteristic functions do. □ 

2.3.8 Quantum Stochastic Differential Equations 

We have defined stochastic calculus in terms of the random variables B(. Ac- 
cording to theorem (2.21), they correspond to the field operators $(l[o,t]) = 
i(a(l[o,t]) — a^(l[o,«])) through the isomorphism 7. 

However, we may as well choose to diagonalize^ the conjugate momenta 

n(l[o_|r]) = a(l[o./]) + a^(l[o,(]), and integrate w.r.t. those instead. We then 
have two noncommuting notions of stochastic integration living on the same 
Fock space. 

This train of thought is broTight to its logical conclusion by the theory of 
quantum stochastic integration, due to Hudson and Parthasarathy. 

Adapted Processes 

We take U = L^{R+), and write H = Ht ® H* with Ht = L^{[0,t)) and 
•H* = L2([t,oo)). If we write T = F{n), Ft = F{Ht) and = then 
this translates into = (g) J-"* . 

We take 5 to be the dense set of locally bounded functions in 'H. The set 
e{S) of exponential vectors then spans F . 

We choose an auxiliary Hilbert space /C. One may think of as the Hilbert 
space of the electromagnetic field, and of /C as the Hilbert space of some system 
(an atom, say) in interaction with this field. Our dynamics will take place in 
K® the Hilbert space of the combined system. 

The von Neumann algebra M = B{1C ® F) of observables possesses the 
natural filtration {Mt ; t E M^}, with Ml = B(T-L ® Fi ) ® Ij^t the von Neumann 
algebra of observables up to time t. We denote its commutant by A/"*. It is the 
algebra of future events, = IjciSiJ^t 

Definition 2.11 An adapted process based on S is a family {Ft ; t € M+} 0/ 
operators such that Ft is affiliated to N't, and )C®e{S) is in the domain of each 
member. 

The operator Ft does not 'know' what happens after time t. The common core 
is a convenient and not overly restrictive technical requirement. 



''This is easily done by composing 7 : T{L'^{K.+ )) ^ L^(Cs(IR+), Pw) with the 'Fourier 
transform' J^{i) e U{T{L'^{R+))). 
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Definition 2.12 An adapted process Ft is called simple if there is an increasing 
sequence tk with to = and ife — > oo such that Ft = 'Ylk=o -^i''^ltk,tk+i)i^)- 
It is called continuous if the map 7 : t i— >■ FtU (Ei e(/) is continuous for all 
u G IC, f & S, and it is called locally square integrable if 7 is measurable, and 
lo 117(^)11^'^* finite for each t e M+. 

The locally square integrable processes arc sTifEciently general to cater to our 
needs, and the simple ones are, as the name suggests, easy to handle. Luckily, 
locally square integrable processes can be approximated by simple ones. 

Proposition 2.22 (Muthuramalingam, [HP84]) Any locally square integra- 
ble process Ft can be approximated by a sequence of simple processes F^, in the 
sense that for each t £ M+, u G IC and f & S, we have 

lim / \\{F,-F^)u(^e{f)fds = 0. 

Proof. One first shows that the simple processes approximate continuous ones. 
From this, one sees that it suffices to approximate Ft by the convolution pro- 
cesses Ff = (F * fcl[o^i/fc])(i), which are continuous. See [HP84] for details. □ 

Quantum Stochastic Integration 

Stochastic integration is integration against the distinguished Wiener process 
Bf In the noncommutative version, there are two separate processes which 
take its role: the annihilation process A{t) = a(l[o t]) and the creation process 
^t(f) = at(l[o,,]). 

Definition 2.13 Let Ft = ^'^Lo ^k^[tk,tk+i)i'^) ^ simple process, and let 
T — tn- (Refine the sequence tk by inserting T if necessary.) We then define 
the stochastic integral 

/ FtdAt = Fk{A{tk+i) - A{tk)) . 

The expression J^FtdAl is defined in a similar fashion, with ^ in stead of A, 
and of course we have Ftdt = J2k=o Pk{tk+i — tk) as usual. 

Note that Fk is affiliated to J\ft^ , acting essentially on K, (E) Tt^, ■ On the other 
hand, A{tk+i) — A(tk) = a(l[/,^,jj.^j]) is affiliated to its commutant A/^',,, acting 
essentially on T*'^ . In particular. commutes with A^^^. and AAJ^. We have 
used (and will use from now on) the notation A// for /(^a+i) ^ f{tk)- 

We can now define the stochastic integral of any locally square integrable 
process Ft by approximating it with simple processes. If F" — >■ -F* in the sense 
of proposition 2.22, then one can show, (and this is not entirely trivial [HP84]), 
that 

hm F^dAt 

n— )-cso 
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converges on vectors of the type u e(/). We will take this to be our stochastic 
integral J* FtdAt (and similarly for J^FfdAl). We introduce the 'stochastic 
differential equation' dM^ = FtdAf + GtdAl + Hfdt as shorthand notation for 
the integral equation Mt = Mq + FgdAg + GgdAl + /J Hgds. 

A Quantum Version of Ito's Formula 

We can now prove a noncommutative version of Ito's formula for stochastic 

integrals of continuous processes. 

Theorem 2.23 (Hudson, Parthasarathy) Let Mt and M[ he integrals of 
continuous processes, i.e. dMt = FfdAt + GtdAl + Hfdt and dM[ = F{.dAt + 
G'^dAl + H'^dt. Suppose that the products between the operators Mt, Ft,Gt, Hf 
and M'g,F^,G'g,H'g are well defined on IC ® e{S). Then the adapted process 
t MfMl satisfies d{MtM[) = d{Mt)M[+Mtd{M'^)+dMtdM[ , where dMtdM[ 
should he evaluated according to the following quantum ltd tahle. 



X 


dt 


dAt 


dA\ 


dt 











dAt 








dt 


dA\ 












In other words, 

d{MtM't) = {FtM't+MtF't)dAtHGtM't+MtG't)dA\ + {HtM't+MtH't+FtG't)dt . 

Proof We expand A{MtMl) = (AMt)M; + Mt{AMl) + AMtAM^, and focus 
on the last term. In the limit At — )• 0, this equals 

AMtAMi = FtFliAAtf + GtG'tiAAlf + HtH'.iAtf 
+FtG'tAAtAAl + {FtH't + HtF^)AAtAt 
+GtF^AJ^AAt + {GtH't + HtG't)AA\At . 

Each one of these 7 terms is a product of a term in A/j and one in AA*, so that we 
can split their contributions. For example, take FtG'tAAtAA\. Consider it as 
FtG't®lj^t xl^^j,^® AAtAA\, B.nd split M(g)e(/i,) = {u®e{hl]fi^t\))®(^{h\t,oo]) 
accordingly. We see that 

{u ® e{h) I FtG'tAAtAAlu' ® e{h')) 

equals 

{u O e{hl[o,t]) I FtG't u' ® e{h'l[o,t])) 

times 

(e(W[t,oo)) I a(l[tfc,t,+i])a^(l[tfc,tfc+i])e(/j'l[t,oo))) • 
A look at equation (2.20) reveals that this second factor equals 

I (^^ h'{s)ds^ (^^ h{s)ds^ + Aij (e(/il[t,oo)) I e(/i'l[t,oo))) • 
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As h' and h are locally bounded, the first term is quadratic in At. If we take 

M 0, we are left with (w, ® e{h) \ FtG[ u' (g) e{h'))At. The other 6 terms are 
of second order, as equations (2.17), (2.18), (2.19) lack the extra {ui | U2) term 
of equation (2.20). □ 



Quantum Stochaistic Differential Equations 

Let Vi , V2 and V3 be bounded operators on /C. We can then define the quantum 
stochastic differential equation 

dMt = Mt{VidAt + V2dAl+V3dt) (2.29) 

as shorthand for the quantum integral equation 

Mt = Mo+ [ Mt{VidAt + V2dA\ + V^dt) . 
Jo 

Theorem 2.24 Equation (2.29) has a unique continuous solution with Mq = 1. 
It is unitary if and only if (Vi, V2, V3) takes the shape {V\—V, iH — ^VW). 

Proof. The existence of a unique solution can be shown by means of a Picard- 
type iteration procedure, see [HP84]. The solution is unitary, M^Mi ~ 1, if 
and only if d{MjMt) = 0. Using theorem 2.23, we calculate this by adding 
M^dMt = VidAt + V2dAl + V^dt, d{Ml)Mt = V^dA\ + V^dAt + V^dt and 
d{Mj)d{Mt) = V^V2dt. The result is identically zero if and only if Vi =0 
and Vs + V^^ + ¥^¥2 = 0. □ 

If we think of J-"(L^(M"'")) as the Hilbert space of the electromagnetic field, and 
of K. as the Hilbert space of an atom coupled to this field, then time evolution 
is described by the Schrodinger equation 

if^U^it) = {Hs^1 + 1®Hf + XHi)U^{t) 

on /C (g) J^(L^(M+)). In this expression, Hs is the Hamiltonian of the system /C, 
Hp is the the free field Hamiltonian on J^(L^(K+)), and Hj = i{D (g) a^{g) — 
(g) a{g)) is the interaction Hamiltonian on /C (g) J^(Z/^(M"'")) with coupling 
constant A. 

Since U^{t) can often be easily solved, it suffices to find the interaction pic- 
ture time evolution U^{t) := U^{t)U"{-t). The 'van Hove' limit of U^{t/\'^) for 
A I exists under suitable conditions. According to a theorem of Accardi, Frige- 
rio and Lu, see [AFL90], it is given by a unitary quantum stochastic differential 
equation of the form 

dUt = UtivUAt - VdA\ - Iv'^Vdt) . 
This equation will play a pivotal role in chapters 5 and 6. 
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2.4 Bundles and Classical Fields 

We have reviewed the foundations of quantum mechanics, and scratched the 
surface of a tiny part of quantum field theory. We will now take a step back, 
and submit the basic framework of classical field theory to a closer look. 

In classical field theory, a system is modelled by a smooth fibre bundle 
TT : F ^ M, and fields are described by sections of tt. The base manifold M 
represents space-time. The dynamics are prescribed by a Lagrangian density 
£ : J^F A"T*M, with n the dimension of M. Transformations of the theory 
correspond to automorphisms of F. 

2.4.1 Bundles, Sections and Jets 

Intuitively speaking, a fibre bundle over M with fibre Fq is a smoothly varying 
family {Fj. ; x € M} of copies of Fq. A section is a function (j) : M ^ F such 
that its value on x G M lies in the fibre Fx over x. The fc-jet j^cf) of (f> a,t x is its 
value <j){x) at x, together with its derivatives at x up to order k. Let us make 
this a bit more precise. 

Fibre Bundles 

If F, F', M and M' are smooth manifolds, then an isomorphism between the 
maps n : F ^ M and tt' : i^' — > M' is by definition a pair (</>, 0m) of diffeomor- 
phisms (f) : F ^ F' and 0m : M — >■ M' such that tt' o = 0m o tt. Since 0m 
is completely determined by 0, we will suppress it in the notation, and write 
for (0, 0m)- If M = M', then is called vertical if 0m = idM- 

Definition 2.14 Let F , M and Fq be smooth manifolds. Then a fibre bundle 
TT : F — >■ M with fibre Fq is a smooth map with the property that each x € M has 
an open neighbourhood U such that the restriction w : 7r~^(?7) U is vertically 
isomorphic to the projection it' : U x Fq ^ U on the first factor. 

In short, F is locally the product of the base by Fq. A vertical isomorphism : 

TT~^{U) — > t/ X _Fo is called a local trivialization. The manifold Fx := 7r~^({x}) 
is called the fibre over x. Each local trivialization restricts to a diffeomorphism 
?!'|f^ : Fx — > {x} X Fq, so that in particular, any fibre Fx is difi^eomorphic to Fq. 

For any fibre bundle F — > M, we can produce adapted co-ordinates as fol- 
lows. Choose co-ordinates x^^ : M D U —?' M. on M, co-ordinates v°- : Fq Z) 
y — >■ M on Fq, and local trivializations 0;/ : Tr^^{U) U x Fq. The adapted 
co-ordinates on F are then (x^^ o <j)u,v'^ o 0[/). We will simply denote them 
{x^,v°'). They have the convenient property that x'^f) = x^if) for all /x if 
and only if / and /' are in the same fibre. 

Finally, if F -> M is any fibre bundle, and f : N M is a. smooth map, 
then we can define the puUback bundle f*F^Nas the manifold f*F = 
{(n, /) e N X F; f{n) = 7r(/)}, with projection (n, /) >-)■ n. The idea is that 
the map / allows one to regard the fibre over /(n) e M as a fibre over n G N, 

i-e. f*Fn = Ff(^n)- 
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Vector Bundles and Principal Fibre Bundles 

It is useful to study fibre bundles with additional structure on the fibres. A 

fibre bundle tt : E ^ M is called a vector bundle if Ea and each of the fibres 
E^ are vector spaces, and if one can choose the local trivializations </> over U in 
such a way that for any x G M, the map (f)\E^ : Eq is an isomorphism 

of vector spaces. Similarly, a fibre bundle tt : P ^ M is called a principal fibre 
bundle with Lie group G if Pq and the fibres P^ carry a free transitive right 
action of G, and if the local trivializations can be chosen to be G-equivariant. 

The tangent bundle is a natural example of a vector bundle. The tangent 
space Ta,M of M at a; € M is by definition the space of vectors tangent to M 
at a;. If M is n-dimensional, then T^M is a vector space isomorphic to M". 
This makes the tangent bundle TM = UxemT^M into a vector bundle, with 
projection tt : TM — )• M assigning to each tangent vector G T^M its base 
point 7r(^j,) = X. 

Starting from the tangent bundle, one can construct many other interesting 
fibre bundles. If tt : f — >■ M is a any fibre bundle, then the kernel Tj" F of the 
pushforward tt* : TfF T^f^f^M is a linear subspace of TfF, and we obtain 
the vertical tangent bundle F F, a. vector subbundle of TF — > F. One 
can think of it as the union of all the separate tangent spaces T[Fx). 

The cotangent bundle tt : T*M M is the vector bundle defined by letting 
T*M be the dual of T^M, i.e. the vector space of linear maps T^M — > K. 

Similarly, a volume form on the vector space T^M is an alternating mul- 
tiUnear map T^M x ... x T^M — )• M on n copies of T^M. We denote the 1- 
dimcnsional space of volume forms on T^M by A"T*M. Their union A"T*M = 
Ua:eM A"r^M is a line bundle; a vector bundle over M with 1-dimensional fibre. 

The frame bundle F{M) — )• M is a natural example of a principal fibre 
bundle. A frame at x is by definition a linear isomorphism / : R" Tx{M). 
If we fix the standard basis (ei, . . . , e„) of M", then frames f at x correspond 
precisely to bases (/(ei), . . . , f{en)) of Tx{M). The space of frames at x, denoted 
Fx{M), has a free transitive right action g : / i— >■ f o g oi GL(M"), making 
F{M) = UxeMFxi^) into a principal fibre bundle. 

Sections 

A section of tt : F ^ M is a smooth map <j> : M ^ F such that tt o = idM, 
and we denote the space of sections by T{F). 

If F happens to be the trivial bundle M x i^oi then sections correspond 
precisely to functions M ~> Fq. One way to view sections and bundles is 
as follows. Take the base manifold M, and cut it into parts Ut that overlap 
only at their boundaries, and that arc small enough to fit into a trivializing 
neighbourhood. A section (/> of F is then a collection of functions (pi -.Ui ^ Fq, 
and the bundle F ^ M encodes the boundary conditions that and have 
to satisfy on dUi n dUj . 

A section of the tangent bundle TM ^ M is precisely a smooth vector field 
on M, so that the commutator bracket x] of vector fields endows T(TM) 
with a structure of Lie algebra. A section of T*M is called a 1-form, and a 
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section of A"T*M a volume form. 

The tangent bundle TS^ — > 5^ of the 2-sphere is a nice nice example of a 
vector bundle that is not trivial. Every vector field on S"^ must have at least one 
zero,^ showing that a section of TS'^ — >■ 5^ is not quite the same as a function 

Jet Bundles 

Intuitively speaking, the fc-jet of a section at a; is its value 4'{x), together with 

its derivatives at x up to order k. 

To put this in a more formal setting, we introduce the relation ~ on the 
local sections around a;, where ~ if ^ and (j)' have the same value at x, and 
if their derivatives agree up to order k. The A;-jet of (j) at x, denoted j^cf), is now 
simply the equivalence class of </> modulo ~. 

We denote the set of all fc-jets at x by J^F, and we define J^'F := Ux^mJ^P 
to be the jet bundle. It is a fibre bundle in two different ways. The map 
TTfe : J'^F — > M defined by j^((/>) ^ x makes J'^F into a bundle over M, and 
the map -Kkfi ■ J''F — )• F that maps j^((/>) to (f){x) makes it into a bundle over 
F. We have tt o nk.o = T^k- Any section (f> oi it : F ^ M yields a section j^cj) of 
TTfe : J'^F — M, but not all sections of J'^F are of this form. 

Adapted co-ordinates (x^, v") on F give rise to the co-ordinates {x^, v"',v^) 
on J^F by .T^'Ci^f/)) = .^^(.t), v^ijlcj)) = and = 9p^"((?!)(x)). If 

is a section of F, then we denote 0" := v"" o </>, so that 9^0" = f^O'^f/))- 

2.4.2 The Euler-Lagrange Equation 

Having described its basic ingredients, we now have a closer look at classical 
field theory itself. A physical system is described by a fibre bundle tt : F — > M, 
the base manifold Af representing space-time. Fields are modelled by sections 
of TT, and the dynamics are prescribed by a bundle map C : J^F A"'{T* M) 
called'' the Lagrangian density, or Lagrangian for short. It singles out a class of 
preferred sections by means of the action principle. These are considered to be 
the fields of physical relevance. 

The Action Principle 

Let U be an open submanifold of M with smooth boundary and compact closure. 
We define the action functional Su : T{F) ^ M by Sui(l)) = J^j^<t>*C. The 
action principle then states that the physical fields are precisely the critical 
points of the action Su under infinitesimal variations with support inside U. 

An infinitesimal variation of a section ^ e T{F) is precisely a section 6<p of 
(j}*T^ F, the puUback of the vertical tangent bundle of F along (j). Indeed, if (jj^ 

®The theorem of P. Bohl, J. Hadamard and L. E. J. Brouwer that every (continuous) vector 
field on a sphere of even dimension must have at least one zero is known colloquially as the 
'hairy ball theorem': one cannot comb the hair on a 2n-sphere without creating a crown. 

^Generalization to Lagrangian densities on J^F with A; > 1 is straightforward, and the 

Euler-Lagrangc equation then becomes a PDE of order k + 1. We focus on the case fc = 1 for 
notational convenience, but also because most fundamental field equations are of order two. 
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is a one-parameter family of sections of F with (j)o = (j), then (5^(a;) = ^lo^^e 
yields a vertical vector in T^(x)F for each x G M. 

If we choose adapted co-ordinates (a;^, f", uj^), then we can write the La- 
grangian density as £ = L{x^, v'^, v^)vol, with vol = dx^ A ... A rfa;" the volume 
form assigning unit volume to the frame (gfr, • • • , gfjr), and L the so-called 
Lagrangian function. The action then reads Su{(t>) = Ju L{x^ ,dij,(l)°')\o\. 

In order to avoid a jungle of sigmas, we will adopt the convention that 
repeated indices indicate summation. We also use the shorthand notation 
for X I-)- ^{x^^{x),(j>°'{x),d^(j)°'{x)), and similarly, we write g^g^a) for the 
function X ^ §^^{x>'{x),r{x),d^r{x))- 

The following theorem shows that critical sections correspond to solutions 
of a second order P.D.E. called the Euler-Lagrange equation. 

Theorem 2.25 (Euler-Lagrange) A section (p of F is a stationary point of 
the action functionals Sjj under variations d(p with support strictly contained in 
U if and only if it satisfies the second order P.D.E. 

[£]"^ = 0, (2.30) 

called the Euler-Lagrange equation. The map [C]'^ : (j)*T^F A"T*M of 
vector bundles is well defined by 

The r.h.s. of (2.31) does not depend on the choice of co-ordinates, whether (j) is 

critical or not. 



Proof. Since -^^(x) and Q(jp'^{x) only depend on (j) through j^cj), we see from 

(2.31) that [£]'^(a;) depends only on </> through so that (2.30) is indeed a 
P.D.E. of order 2. 

The variation of So along = 5<f)°'da reads 

''^^^^=X(l^'^"+a(£^''^^")^°^- 

Note that 5d^<j)°- = 9^50", because ■^\od^(j)'^ = d^-^\o(j)'^. We can then write 

If we now define [C Y' : (j)*T^ F -S- A"T*M by 

[£]^<5</.) = ((d,j-^) - ^ ) yol-Sr (2.32) 



'a(5^0«) 

and : ^*T^ F K^'^T^M by 



{^}' W) = dio;:^)'^^'^^^ ■ ' (2.33) 
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then we have 

5Su{cP) = - [ [C]'''{6cP)+ [ {C}'^{6<l>). (2.34) 

Ju JdU 

It can be seen from (2.34) that [jC]'^ and {C}'^ do not depend on our choice of 

co-ordinates, a fact not at all obvious from equations (2.31) and (2.33). 

The argument runs as follows. Let [£]^ and be two versions of (2.31), 
defined w.r.t different co-ordinate systems. Let 5(f) be any section of (j)*T^ (F) 
that vanishes on dU. Then the second term on the r.h.s. of (2.34) vanishes, 
and we have both SSu{(t)) = - J^jiCfjScf)) and SSuift)) = - Iu[^]U^(t})- This 
means that Ju{[^]a ~ [-^ ]b)(<5'/') = for all infinitesimal variations vanishing 
at dU, so that = [C]g holds over the interior of U. But since U was 

arbitrary, equality must hold over all of M. 

We can now show that also {L}'^ does not depend on the choice of co- 
ordinates. Let d(f> be any section of (j)*T^{F), not necessarily vanishing at dU. 
According to equation (2.34), both Jgjj{£}'^{S(p) and /g^{'C}^((5(/)) arc equal to 
SSu((f>) + j\j[C]'^{S(t>). (Wc just established that (Scf)) is the same for A and 
B.) Once again, /^j^{£}^((5(/)) — {£}g{6(p) = for arbitrary 5(p and U implies 

that {£}^ = {£1^ must hold over all of M. 

It is clear from (2.34) that Su is stationary at <i> under variations dd> with 
support inside U if and only if J^[£]'^{6(j)) is zero for all such variations. This 
is the case if and only if [£]'^ is identically zero. □ 



Example: the Klein-Gordon Equation 

One of the simplest examples of an Euler-Lagrange equation the following. Let 
M = R'^, and equip it with the Minkowski metric rj of signature (—,+,+,+). In 
co-ordinates (a;°, x^, x^, x^), this reads 77(^, x) = VtJ.i^^'^X'^ y with 77^^ the diagonal 
matrix diag( — 1, 1, 1, 1). Wc denote = Vfj-uC^ , so that = "n^^^v- 

Let F be the trivial bundle M'' x C ^ M''. which comes with co-ordinates 
(x^; v""). The index fi runs from to 3, and a runs from to 1. (We write z G C 
as z = + iv^.) We define the Lagrangian 

>C(jV) = (^^'^'/'a^M'/'" - f vol. 

We have = m^^ and g(g"^a) = fn~^d'^(j)a-, so that the Euler-Lagrange 
equation in this case is the Klein Gordon equation 

It allows for plane wave solutions (f){x) = exp{knX^) with = m^. 

Note that certainly not all Euler-Lagrange equations are linear. For in- 
stance, if F is the bundle x M ^ M^, rj is of signature (-I-, — ), and £{j^(j>) = 
\dfj,<j)d^cf) — (1 — cos(0)), then the resulting Euler-Lagrange equation d^d^cf) = 
— sin((/I)), rather childishly called the sine-Gordon equation, is a nonlinear P.D.E. 
with some bearing on surfaces of constant negative curvature. 
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Example: Electromagnetism 

The most classical of all field theories is electromagnetism. The base manifold 

M is again with the Minkowski metric, but the bundle F is now the cotangent 
bundle T*M ^ M. This is a trivial vector bundle for M = W^. If dx"" is the dual 
of dx" in T*M, then a global trivialization is given by describing p^dx" G T*M 
with the co-ordinates (x'^^pi,). 

The basic field in electromagnetism is a 1-form A S r(T*M) called the elec- 
tromagnetic four-potential. We set p^{A{x)) = A^{x), and write A = Ai^{x)dx^. 
We define the antisymmetric tensor _F),,y := d/^Aiy — d^A^, so that F = 2dA, 
with F = F^j,dx^ A dx'^ . The (vacuum) EM-Lagrangian is then 

C = -\f>^^F^,vo\. 

We have -g§^ = —F^^^, and = 0. The Euler-Lagrange equation is then 
the (vacuum) Maxwell equation 

^^F^"' = . 

The 3 components Ei = FiQ arc identified with the electric field, and the 3 
components Bi = \eijkFjk with the magnetic field. Light is described by the 
plane wave solutions Afj_{x) = a^exp{ikfjtX'^) with k^k'^ = 0, a^fc** = and, at 
least in the Coulomb gauge, ao = 0. 

2.4.3 Transformations and Symmetries 

Transformations of the system correspond to automorphisms of the fibre bundle. 
An automorphism of tt : ^ M is a pair {a, um) with a a diffeomorphism of 
the total space F, aM a diffeomorphism of the base M, and w o a = au ° tt. 
If F is a vector bundle, we additionally require that a be linear on the fibres, 
and if F is a principal fibre bundle, we require a to be equivariant w.r.t. the 
group action. Because aM is completely determined by a, we drop it from the 
notation, and write a instead of {a,aM)- An automorphism is called vertical if 
aM = id. 

The automorphisms of F form the group Aut(i^), and the vertical automor- 
phisms Aut^(i^) form a normal subgroup. If we denote the diffeomorphism 
group of M by Diff(M), then a i— >■ aM is a homomorphism from Aut(F) to 
Diff(M) with kernel Aut^(i^). Its image Diff^(M), the group of liftable diffeo- 
morphisms, is an open subgroup of Diff(A'/). We have the exact sequence 

1 ^ Aut^(^^) ^ Aut(F) ^ Diff^(M) ^ 1 . (2.35) 

The group Aut(F) of transformations acts on the space T{F) of fields by 
(j) I— >■ a o (j) o a]^. Note however that without any additional structure, the 
diffeomorphism group Diff(M) does not have a natural action on the fields 
T{F). The extra structure that is required to have Diff^(M) act on the fields 
r(F) is precisely a homomorphism E : Diff^(A'/) Aut(F) splitting (2.35). 

An automorphism a of _F acts on J^F by j^a : j\<p h- > i\{ao ct)o a^/). This 
means that a takes the Lagrangian C : J^F — >■ A"T*M into a*j^ o £, o j^a. 
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Definition 2.15 An automorphism a of F is called a symmetry of the La- 

grangian C : J^F A"T*M i/£ = o Coj^a. 

This is a sensible definition. If ^ is a local section of F over U, then ao(j)o a'^j 
is a local section over a{U). One readily checks that if a is a symmetry of £, 
then Suid)) = Sa{u){'^ o (po aj^). In particular, ao cj) o is a critical section 
if and only if (j) is, and the action of a on T{F) restricts to the solution space 
^el{F) of the Euler-Lagrange equation. 

Infinitesimal Symmetries 

It will be expedient to look at this picture from the infinitesimal point of view. 
The Lie algebra of Diff (M) is, at least morally speaking, the Lie algebra T{TM) 
of vector fields on M . The group Aut(F) corresponds to the Lie algebra T^{TF) 
of projectable vector fields on F, i.e. vector fields ^ G T{TF) such that tt^^j = 
7r4/. whenever 7r(/) = 7r(/'). The pushforward 7r» : r^(Ti^) -> T{TM) is not 
only well defined, but also a homomorphism of Lie algebras. Its kernel, the 
ideal (TF) of vertical vector fields, corresponds to the group Aut^(F), and 
we have the exact sequence of Lie algebras 

— > r^(TF) — > r^{TF) ^ T{TM) — > . (2.36) 

The infinitesimal variation Sep G T{(j)*T^ F) of cf) under ^ e r^(rF) is given 
by 64)x = ^lo^e o 0o "^m(2;); "^'itli = cxp(eC) and a^^M = exp(e7r*^) the 
induced flows on F and M respectively. It is readily seen to be 5(t>x — d^4'{^4>{x)), 
with d'"4'{^) = ^ — (f)^,n^^ the vertical projection T^(x)F — > T^(x)F along the 
image of : T^M T^(x)F. 

In order to obtain the infinitesimal version of definition 2.15, we require that 
^ locc* j^Loj^a^ vanish. Variation of a* gives rise to a term L^^^£(j^((/))) = 
d{iT^^^C{j^(j))). Variation of £ through the variation -^ote o(j)o a~\j of the field 
gives rise to an additional term —[C]'^{5(f) + d{{C}'^{54))), cf. equation (2.34), 
with 5(f)x = d'" (f>{^fj,(^x)) ■ This leads us to the following definition of infinitesimal 
symmetries. 

Definition 2.16 A projectable vector field ^ G r^{TF) is called an infinitesi- 
mal symmetry of the Lagrangian C : J^F ^ A^T* M if 

-[c]Hd^m) + d{i.,^c + {ri^dXO) = . 

From this definition, we immediately sec that infinitesimal symmetries give 
rise to conserved quantities. If we define the Noether current associated to the 
infinitesimal symmetry ^ and the field <j) to be the (n — l)-form 

J"^(0 = {C}t'{d^m) + i^.«'C(j^(0)) , (2.37) 
then the following theorem follows straight from the definitions. 

Theorem 2.26 (Noether) If (p is a critical point of the action, and ^ is an 

infinitesimal symmetry, then dj'^{£_) — 0. 

Proof Wc have dC = -[C]'''{d"(t){^) + dJ'^iO- Now 5C = because ^ is an 
iiifinitesiinal symmetry, and = by the Euler-Lagrange equation. □ 
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The rationale behind considering closed (n— l)-forms as conserved quantities is 
the following. Let t be a (local) time co-ordinate, and let S be a compact (n— 1)- 
dimensional submanifold with boundary of the (local) time-slice t = Iq. If we 
interpret Qe = Js -^"^(O as the 'charge' of S, then = J^Cg^ J '''{(,), with 

Cq^ the Lie derivative along dt- Now C^tJ'^iO = dig^J'^iS,) because dJ'^{£^) = 0, 
and didfJ'^iS,) = Jq-^ idt^'^iO by Gaufi' theorem. If we now interpret = 
Jg^idtJ'^iO as the flux through 9S, then ^Qe = ^as says that the rate of 
change of the charge of S is equal to the flux through its boundary. 

Strictly speaking then, Qs is not a conserved quantity at all; we merely know 
that its rate of change can be attributed to a flux. However, if the current goes 
to zero fast enough at spatial inflnity, then the total 'charge' of the time-slice 
t = to is conserved. 

Example: the Current of a Klein-Gordon Field 

The Klein-Gordon Lagrangian allows the one-parameter group of vertical ro- 
tations ag : (.T^ ) n- (.t''; cos(6')w'' + sin(6')f ^, — sin((?)w'' + cos{0)v^) as a 
group of symmetries. The corresponding inflnitesimal symmetry is the vector 
fleld ^ = v^-^ — v^-£ji. We calculate the corresponding current from equa- 
tion (2.37). Because ^ is vertical, tt^ = and d"(?!)(^) = ^. Since = 
^^^'(i}ai^^yo\, we get J<^(0 = ^((/>°a''(/>i - 4^^^' 4P)io ^yo\. It is not hard to 
check 'by hand' that dij,J^{^) = for solutions of the Klein-Gordon equation, 
as predicted by Emmy Noether. 

Transformations and Symmetries of Space-Time 

We would like to associate conserved quantities to inflnitesimal symmetries of 
space-time. However, in order to even deflne what a space-time symmetry is, 
we need to know how Diff(M) acts on the flelds. In other words, we need a 
homomorphism S : Diff^(M) — >■ Aut(i^) that splits (2.35). A transformation 
aM S Diff (M) is then called a space-time symmetry of £ if E(q:m) is a symmetry 
in the sense of deflnition 2.15. 

An inflnitesimal transformation of space-time is a vector fleld £,m G r(TM). 
In order to describe its action on the fields, we need a splitting a : r(TM) 
T^{TF) of the exact sequence (2.36) of Lie-algebras. A vector fleld e T{TM) 
is called an inflnitesimal space-time symmetry of C if cr(^M) is a symmetry in 
the sense of definition 2.16. 

This Lie algebra homomorphism a : T{TM) r^{TF) will be the object of 
our attention in chapter 7. We will argue there that it is of more fundamental 
importance than its counterpart S : Diff(M) Aut(F) at the level of groups. 

Although every flat connection gives rise to a splitting of (2.36), it is certainly 
not true that every splitting arises in this way. However, in chapter 7, proposi- 
tion 7.7 and corollary 7.17, we will prove that under reasonable assumptions^, 
a must be a differential operator of finite order k. 



^Namely, wc assume tliat tlic fibre bundle F is associated to a principal fibre bundle, and 
that the infinitesimal transformations respect this association. 
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This means that the map of sections a : V{TM) — >■ V^iTF) comes from a 
map V : tt* J^TM — > TF of bundles over F. In local co-ordinates, there then 

exist functions C"'''^'"''' (x^, w") such that 

<y{i^d^) = ^d^ + (C;r + C^-^'dp^ + ■■■ + C^'P^ -'-dp, . . . d,,e) da . 
Note that the case k = corresponds to a flat connection. 

The Stress-Energy-Momentum Tensor 

Once we have a splitting a, it is tempting to define the associated conserved 
current as J'^{a{^M))- Appealing as this may seem, it is not the proper course 
of action in the presence of a suSiciently large group of space-time symmetries, 
as noted first in [Noel 8]. 

Instead, one can use a in order to construct a Stress-Energy-Momentum 
(SEM) tensor as follows. Choose adapted co-ordinates, and expand 

/ FIT BT \ 

into 

5C = TrC^oX + TPrdpC^oX +... + TP'-P^+'dp, . . . a^,+i^rVol , (2.38) 
using S(f) = d'''(t>{a{£,M)), i-e. 

= (c; - drr)e + c^'^dpC + ■■■ + c^-^p-'-'P'-dp, . . . dp,c ■ 

The tensor density T''^((/))vol is then the desired SEM-tcnsor. If S is a 
compact (n — l)-dimensional submanifold with boundary of the equal-time slice 
t = to, then for r equal to zero, the quantity Pr = J^. T''^(0)ia^vol is interpreted 
as the energy of the flelds in S. For r equal to 1 through n — 1, it is interpreted 
as the momentum. 

The problem with this deflnition is of course that the tensor density T^^(<;i)vol 

may well depend on the choice of co-ordinates. However, if C is diffeomorphism 
invariant, then (5£ = in equation (2.38) for any choice of ^ S r(TM), forcing 
aU the TP'^-P'{4>) to vanish identically 

Example: the SEM-Tensor of Electromagnetism 

The tangent bundle comes with a natural lift of Diff(M). For any diffeomor- 
phism a, the pushforward a* maps T^M to Ta{x)M, and is therefore an au- 
tomorphism of TM covering a. The lift S(q:) = a, is a homomorphism of 
groups. 

Similarly, the exact sequence (2.35) for the cotangent bundle T*M is split by 
the inverse pullback Tj{a) = (a^^)* , considered as a map Diff(M) A\it{T* M). 
Its infinitesimal version is the Lie algebra homomorphism o"(^) = 6xp(— £^)*, 
which reads cr(^^9^) = ^^9^ — Pa^vS," ^ in local co-ordinates. 
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The infinitesimal variation 6 A G T{A*{T^T*M)) oi A G r{T*M) along a{^) 
reads SA = d'"A{a{£,)) = <j{£,)a(x) — A^,£^. In local co-ordinates, this is 5A = 
{^d^ -AM'^^)- ieO^ + ^-d,A,^),i.e.5A = - {AM" + CdM4:- 
If we identify A*(T^T*M) with T*M, which we can do because T*M happens 
to be a vector bundle, then we simply have 6 A = ^C^A. 

Now that we know that action of r{TM) on r(T*M) is the Lie derivative 
- What else could it have been? - calculating the SEM-tensor is straightforward. 
We expand SC = -{F'^'^d^SA^ - ldr{Fi^''F^^C))^ol into derivatives of ^. 

5£ = (f^^9^(ra.A,+^r5,r)-i5.(F'^''^;.r))voi 

= {F'^'-d^drA, - ia,(F''-F^.))rvol 

+ ArF^'-'d^^YOl 

= - ldr{F'^''F^,)eyo\ 

+ {F^^^F,, - \F^^F^J\) a^rvol 

In the last step, we used that F^^ is antisymmetric, and that = 
—F'^^d^Ardu^'^. According to our definition, the SEM-tensor of electromag- 
net ism is 

T^(A) = F'^-'Fr. - \F^''F^J^r ■ (2.39) 

In particular, the energy density T°° of the EM-field is \{\E\'^ + I^Hvol. 

Remark The above tensor density is co-ordinate independent, which, in our 
limited setup, is a small miracle. As ever so often, the miracle disappears if we 
broaden our view a little. The transformation behaviour S : Diff (M) — > Aut(F) 
of the EM-field allows us to couple it to a metric g^''' , resulting in a Lagrangian 
that equals the one above for g = rj, but is Diff(M)-invariant because the trans- 
formation of the metric cancels that of the fields. The SEM-tensor then vanishes 
identically. It is the sum of a term resulting from SA, and an equal term of op- 
posite sign resulting from Sg. The former term was calculated above, and the 
latter term can be seen to represent a bona fide tensor density from general 
principles. See e.g. [FR04] for details. 

Remark Our definition of the SEM-tensor differs from the one in [GM92] 
and [FR04] by a term [C]aC^''^ which vanishes on shell, i.e. for solutions of the 
Euler-Lagrange equations. Consequently, the above expression for the SEM- 
tensor is correct on shell as well as off shell, whereas the derivation in [GM92] 
is valid only on shell. The term [C]aC!^'^ does appear in Emily Noether's sem- 
inal paper [Noel8], but her aim is shghtly different from ours. She expresses 
d(J'^(^) — [i3]QC"''^'^vol) in terms of [C]'^ and its derivatives, showing that the 
Euler-Lagrange equations are linearly dependent in the presence of symmetry. 
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2.4.4 Infinitesimally Natural Bundles 

The point of deriving (2.39) is not to acquaint the reader of the precise form 

of the electromagnetic SEM-tensor. It will not occur in this thesis again, and 
could be found in a myriad of introductory books on electrodynamics if it did. 
The point is to stress the crucial role that the splitting a : T{TM) T^{TF) 
of (2.36) plays in its derivation. 

Note that in order to derive equation (2.39), we needed to use the nontrivial 
splitting derived from the puUback, even though T*R'^ is trivial as a 

vector bundle! Had wc used the trivial splitting ^(C^^^) = that comes 

from the global trivialization of T*M, we would have obtained the incorrect 
SEM-tensor 

which differs from the correct one by a term d(F'^'^ AT-ig^^voV) — (9^F^'^)At-vo1. 
This would increase the electromagnetic energy of a 3-volume S C K"^ by a 
boundary term J^^ AqE ■ dS, which is accessible to experiment. It is known to 
be incorrect. 

We conclude that a classical field theory is not fully specified by a fibre 
bundle F — )• M and a Lagrangian density £ : J^F — >■ A"T*M. In order to 
derive the correct SEM-tensor, one has to specify the reaction of the fields to 
infinitesimal space-time transformations, that is one has to specify a Lie algebra 
homomorphism a : T{TM) T^{TF) splitting the exact sequence (2.36). 

Even if one has no interest in the SEM-tensor whatsoever, this is a very 
natural thing to ask for. It is hard to imagine fields on space-time without a 
well defined transformation behaviour under infinitesimal space-time transfor- 
mations. For one thing, such fields would have ill-defined spin. 

We therefore consider the splitting cr : T{TM) — >• r^{TF) as part of the 
input data of a classical field theory, on equal footing with the bundle tt : F — >■ M 
and the Lagrangian £ : J^F ^ A"r*Af. This poses an obvious restriction 
on the fibre bundle F — )■ M, namely that it has to admit at least one such 
splitting. We will call such fibre bundles infinitesimally natural. In chapter 7, 
we will classify these infinitesimally natural bundles- or at least the ones with 
sufficiently structured fibres. 
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Information Transfer Implies 
State Collapse 

We attempt to clarify certain puzzles concerning state collapse and decoherence. 
In open quantum systems decoherence is shown to be a necessary consequence 
of the transfer of information to the outside; wc prove an upper bound for 
the amount of coherence which can survive such a transfer. We claim that in 
large closed systems decoherence has never been observed, but we will show 
that it is usually harmless to assume its occurrence. An independent postulate 
of state collapse over and above Schrodinger's equation and the probability 
interpretation of quantum states, is shown to be redundant. 

3.1 Introduction 

In its most basic formulation, quantum theory encodes the preparation of a 
system in a pure quantum state, a unit vector ijj in a Hilbert space "H. Observ- 
ables are modelled by (say, nondegenerate) self-adjoint operators on H. The 
expectation value of an observable A in a state V is given by {ip \ Aip). If a is 
an eigenvalue of A and ipa a unit eigenvector, and information concerning A is 
somehow extracted from the system, then the probability for the value a to be 
observed is \{ipa I V")!^- If this observation is indeed made, then the subsequent 
behaviour of the system is predicted using the pure state ijja- This is called state 
collapse. It follows that, if the information extraction has taken place but the 
information on the value of A is disregarded, then the subsequent behaviour can 
be described optimally using a mixture of cigcnstatcs. This is called decoher- 
ence. In this chapter we substantiate the following claim concerning decoherence 
and state collapse. 

Decoherence is only observed in open systems, where it is a necessary 
consequence of the transfer of information to the outside. 

So the observed occurrence of d(X"oher{Hic{^ does not contradict the Tinitary 
time evolution postulated by quantum mechanics, since open systems do not 
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evolve uniteirily. Decoherence can be explained in quantum theory by embedding 
the quantum system into a larger, closed whole, which in itself evolves unitarily. 
This is well-known (see e.g. [Neu32]). We add the observation that decoherence 
is not only a possibility for an open system, but a necessary consequence of the 
leakage of information out of the system. We prove an inequality relating the 
decoherence between two pure states to the degree in which a decision between 
the two is possible by a measurement outside. This is the content of theorem 3.3 
in section 3.3. 

Also, we have claimed that one does not actually observe decoherence in 
closed macroscopic systems. First of all, most of the systems that are ever ob- 
served arc actually open, since it is extremely difficult to shield large systems 
from interaction. But more to the point, the difference between coherence and 
decoherence can only be seen by measuring some highly exotic 'stray observ- 
ables' which are almost always forbiddingly hard to observe. And indeed, in 
those rare cases where experimenters have succeeded in measuring them, or- 
dinary unitary evolution was found, not decoherence. (See [AN+99], [FP+00], 
[WH+OO].) 

We illustrate the latter point in section 3.4, where we show that the mea- 
surement of two classes of observables can not reveal the difference between 
coherence and decoherence: a class of microscopic observables and a large class 
of macroscopic observables. Take as an example a volume of gas. Microscopic 
observables such as the position of one particular atom in a gas, only relate 
to a small fraction of the system. Macroscopic observables like the center of 
mass of the gas, are the average over a large number of microscopic observables. 
Belonging neither to the macroscopic nor to the microscopic class, the 'stray 
observables' referred to above describe detailed correlations between large num- 
bers of atoms in the gas. This kind of information is experimentally almost 
inaccessible. 

Driving home our point concerning decoherence in closed systems: coher- 
ent superpositions of macroscopically distinguishable states are not the strange 
monsters produced by a quantum theory applied outside its domain. They are, 
on the contrary, everyday occurrences which, however, can not be distinguished 
from the more classical incoherent superpositions in practice, and can therefore 
always be regarded as such. 

3.2 Abstract Information Extraction 

Quantum phenomena are inherently stochastic. This means that, if quantum 
systems are prepared in identical ways, then nevertheless different events may 
be observed. A quantum state describes an ensemble of physical systems, e.g. 
a beam of particles, and is modelled by a normalized trace-class operator p on 
the Hilbert space. The expectation value of an observable A in the state p is 
then tr {pA}. 

An information extraction or measurement on a quantum state is to be con- 
sidered as the partition of such an ensemble into subcnsenibles, each subensem- 
ble corresponding to a measurement outcome. Let us, in the present section. 
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not wonder how the splitting of ensembles can be described by quantum theory, 
but let us see what such an information extraction, if it can be done, will entail 
for the subsequent behaviour of the subensembles. Note that this process may 
serve as part of the preparation for further experiments on the system, so that 
it must again lead to a state. 

3.2.1 Information Extraction 

For simphcity let us assume that only two outcomes can occur, labelled and 
1, say with probabilities po and pi. The ensemble is then split in two parts, 
described by their respective states po and pi. The map 

M : p ^ PoPo ® PiPi (3.1) 

must be normalized, affinc and positive. Indeed, normalization is the property 
that po+Pi = 1, and positivity is the requirement that states must be mapped 
to states. The afRne property entails that for all states p and 6 on the original 
system, and for all A e [0, 1], 

M{Xp + (1 - A)i?) = XM{p) + (1 - A)M(i?) . 

This follows from the physical principle that a system which is prepared in the 
state p with probability A and in the state i9 with probability 1 — A, say by 
tossing a coin, can not be distinguished from a physical system in the state 
Ap + (1 — Xj-d. We emphasize that indeed this is a physical principle, not a 
matter of definitions. It states, for instance, that a bundle of particles having 
50% spin up and 50% spin down can not be distinguished from a bundle having 
50% spin left and 50% spin right. This is a falsifyable statement. 

3.2.2 State Collapse 

The above elementary observations are sufficient to prove that information ex- 
traction implies state collapse. If M distinguishes perfectly between the pure 
states Vo and ipi, then of course po = 1 in case p = \ipo}{'^o\, and pi = 1 if 
p= \tpi){tpi\. 

Proposition 3.1 Let T{'H) denote the space of trace class operators on a Hilhert 
space H, and let the map M : T{H) T{H) ® TiH) : p i-^ Mo{p) © Mi{p) be 
the linear extension of some normalized, affine and positive map on the states. 
Suppose that unit vectors tjjo and ijji exist such that 

M(|Vo)(^o|) = Mo(|^o)(^o|)eO 
M(|Vi)(^i|) = 0eMi(|V'i)(V;i|) . 

Then we have M {{ipo) (tpil) = M(|?/>i)(?/>o|) = 0. 

Proof The positivity of Ad yields M{\ee"^^Jo + i\) {ee''f i>a + > as an 
operator inequality. In particular, the 0-th component must be positive. As 
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Mo(|V'i)(V'i|) = 0) it follows that for all e, ^ e M, we have 

e^Mo{\^Po)M + e (e^'^Mo(|Vo)(V'i|) + e-''^Mo(|Vi)(V'ol)) > 0. 

Taking the limit e 4, yields (e**'Mo(|?Ao)(V^i|) + e-*^Mo(|V'i>(V'o|)) > for 
all e K. In particular for = 0, f ,7r, implying that both Mo(|V'o)(V'i|) 
and Mo(jipi){ipo\) are equal to zero. Exchanging the roles of V'o and -01 in 
the argument above shows that also Mi(|V'o)(V'i|) and Mi(|^i)(^o|) vanish, 
proving the proposition. □ 

We may draw two conclusions from proposition 3.1. The first is that, for all 
= aoli/jo) + ailV'i), we have 

(Mo + Mi)(|V)(^|) = (Mo + Mi)(|ao|'|V'o)(V'o| + |ai|'|^i)(^i|). (3.2) 

In words: for the prediction of events after the splitting of the ensemble in two, 
it no longer matters whether before the splitting the system was in the pure state 

laoV'o +Q:iV'i)(«oV'o +Q;iV'i| or in the mixed state |qoP|V'o)(V'o| + |aiPl^i)(V'i|- 
This phenomenon, which is a direct consequence of the structure (3.1) of the 
measurement process, we will call decoherence. 

The second conclusion from proposition 3.1 is the following. For all lip) = 
aolV'o) + ai|V'i)> we have 

M(|0)(V'|) = |aopMo(|V^o)(V^o|) ® |«i|2Afi(|V'i)(V'i|) • (3.3) 

In words: if an ensemble is split in two parts, then the '0-ensemble' will further 
behave as if the system had been in state V'o instead of ip prior to splitting, and 
the '1-cnsemble' as if it had been in state ijji instead of i/j. This phenomenon 
will be called collapse. 

Throughout this chapter, we will maintain a sharp distinction between the 
collapse M : T(K) ^ T{n) QTiU) and the decoherence (Afo + Mi) : T{U) ^• 
T('H). The former represents the splitting of an ensemble in two parts by means 
of measurement, whereas the latter represents the splitting and subsequent re- 
combination of this ensemble. 

3.3 Open Systems 

A decoherence-mapping (Afg + Mi) : T{H) -> T('H) maps the pure (vec- 
tor) state |ao'0o + Q^i''/'i) (o^oV'o + Q^iV'il and the mixed state |aoPl'0o)(''/'ol + 
|aip|'(/'i)(^/'i| to the same final state. Since unitary maps preserve purity, there 
can not exist a unitary map U : H ^ H such that for all p G T{H): 

{Mo + Mi){p) = UpU* . 

However, according to Schrodinger's equation the development of a closed quan- 
tum system is given by a unitary operator. We conclude that the decoherence 
(3.2) is impossible in a closed system. On the other hand decoherence is a well 
known and experimentally confirmed phenomenon. 
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We will therefore consider open systems, i.e. quantum systems which do not 
obey the Schrodinger equation, but are part of a larger system which does. It has 
often been pointed out (e.g. [Neu32j, [Zur82]) that decoherence can well occur 
in this situation, provided that states are only evaluated on the observables of 
the smaller system. We are more ambitious here: we shall prove that this form 
of 'local' decoherence is not just a possible, but an an unavoidable consequence 
of information-transfer out of the open system. 

3.3.1 Unitciry Information Transfer and Decoherence 

We assume that the open system has Hilbert space H, and that its algebra 
of observables is given by B{^), the bounded operators on %. We may then 
assume that the larger system has Hilbert space K®^, since the only way to 
represent B{7i) on a Hilbert space is in the form A i-^ 1 A [Tak79]. We may 
think^ of B{JC) as the observable algebra of some ancillary system in contact 
with our open quantum system. In this context, H will be referred to as the 
'open system', /C as the 'ancilla' and /C ® "H as the 'closed system'. 

We couple the system to the ancilla during a finite time interval [Q,t]. Let 
T e T(/C) denote the state of the ancilla at time 0, and p e T('H) that of the 
small system. The effect of the interaction is described by a unitary operator 
U : K, ® % — >■ IC (E) H; and the state of the pair at time t is given by the 
density operator U{t (8) p)U* G T(/C (g) H). For convenience, we will define the 
information transfer map T : T{n) T{K, H) by T{p) := U{t p)U* . 

Decoherence 

In the above setup, we are interested in distinguishing whether the open system 
"H was in state or at time 0. This can be done if there exists a 'pointer 
observable' B 1 in the ancilla B{1C) which takes average value 6o in state 
T{\%1>q){%1)q\) and hi in state T{\%l>i){'ijji\). By looking only at the ancilla K. at 
time t, we are then able to gain information on the state of the open system "H 
at time 0. We say that information is transferred from 'H to /C. 

Under these circumstances, we wish to prove that decoherence occurs on the 
open system. We prepare the ground by proving the following lemma. 

Lemma 3.2 . Let z?o, i^i be unit vectors in a Hilbert space C, and let A and B be 
bounded self-adjoint operators on C satisfying \\[A,B]\\ < S\\A\\ ■ \\B\\. For j = 
or 1, let bj := {6j \ BOj) denote the expectation and := {0j \ B^9j) — {9j \ BOj)"^ 
the variance of B in the state Then, if ho ^ h\, 

l(^o|A..)|<Mi^l|A||. 

|0o — Oil 



^Sometimes it may happen, as for instance in fermionic systems, that the observables of 
the ancilla do not all commute with those of the open system. Also the observable algebra on 
K may be smaller than B(K.), but we will neglect these complications here. 
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Proof. Wewnte\{bo-bi){'do\Adi)\as\{'do\{A{B-h)-{B-bo)A+[B,A])A)\. 
Since \\{B - bj)-dj\\'^ = {-dj \ {B - bj)'^'dj) = cr?, we can then use the Cauchy- 
Schwarz inequality to show that 

\{bo-b^){MAd^)\ < \\A\\{a^ + ao)+5\\A\\-\\B\\ . 

□ 

Note that, for (5 = (Tq = fi = 0, lemma 3.2 merely states that commuting op- 
erators respect each other's eigenspaces. We proceed to prove that information 
transfer causes decoherence on the open system. (See [Jan04].) 

Theorem 3.3 Let tpo and tpi be mutually orthogonal unit vectors in a Hilbert 

space %, and let t G T{IC) be a state on a Hilbert space IC. Let U : IC 
K.®U be unitary and define T : T{n) T{1C ® U) by T{p) = U{t ® p)U* . 
Let B be a bounded self-adjoint operator on K ^H, and denote by bj and aj 
its expected value and variance in the state T(\'ipj){'>pj\) for j = 0,1. Suppose 
that bo ^ bi. Then for all ip = aoV'o + ctiipi with |aoP + |c«iP = 1 '^'nd for all 
bounded self-adjoint operators A on JC^H such that \\[A,B]\\ < 6\\A\\ ■ \\B\\, we 
have 



tr(r(|V)(V|)^) - tr(r(|ao|>o)(V'o| + \aif\i>i){i>i\))A) 
S\\B\\+ao + <7i 



< 



\bo-bi\ 



-\\A\\. (3.4) 



Proof. First, we prove (3.4) in the special case that t = \'-f){^\ for some 
vector ip £ K,. We introduce the notation 9j := U{(p (g) tpj). Recall that 
the expectation of B is given by bj = tr(T{\'tpj){^pj\)B'^ , and its variance by 
a] tr {T{\ipj){ijjj\)B'^) - tr'^ {T{\il;j){ipj\)B). In terms of Oj, this reduces to 
bj = {9 J I BOj) and cr^ = (q^ | b^Oj) - {0j \B0j)^. Similarly, the l.h.s. of (3.4) 
equals \a^ai{9Q \ A6i) +a.oal{Oi \ ^^o)!, a quantity bounded by \{9o \ A9i) \ since 
2|ao| • |q!i| < 1. Formula (3.4) is then a direct application of lemma 3.2. 

To reduce the general case to the case above, we note that a non-pure state 
T can always be represented as a vector state. Explicitly, suppose that r decom- 
poses as r = J2ieN\l^i\'^\'fii) ("^il- Then define the Hilbert space JC := 0jgN^«' 
where each Id is a copy of /C. Now since (©jgpj K,i)®'H = '^^ '^^y 

define, for each X S B(]C®'H), the operator X e B{]C®'H) by diagonal action 
on the components of the sum, i.e. Xi^^^^^{ki hi)) := ®j_^^X{ki (g) hi). If 
we now define the vector (fi G K^hy (f = 0^ Pi^i, then we have for all X G Kl^H 
and X e "H: 

tr{U{\^){0\ ® \x){x\)U*X) = {®,^^{l3m ® X) I U*XU®^^^{f3,ip, ® x)>£^« 

= ® x) I ®^ej,u*xui(3,^, ® x))mH 

= E ® I U*XU{^, ® x))mn 

= Y,\Pi\hv{u{\^i){^i\(3\x){x\)u*x) 

ieti 



tr{U{T^\x){x\)U*X) 
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The second step is due to the diagonal action of the operators on K.<S>'H. The 
problem is now reduced to the vector-case by applying the above to x = t/), 
X = ■00 or X = -i/;! and on the other hand X = A, X = BorX = B^. □ 

The backbone of theorem 3.3 is formed by the special case that cto = ui = 0, 
[A, i?] = and T = which allows for a short and transparent proof. 

In order to arrive at a physical interpretation of theorem 3.3, we focus on 
the case B = B ^ 1, when information is transferred from Ti to /C. Indeed, 
examining /C at time t yields information about H at time 0. 

Quality of Information Transfer 

A small ratio ^i,"^^^^^ indicates a good quality of information transfer. The ratio 

equals in the perfect case, when cjo = cri = 0. Thus B<Sl takes a definite value 
of either bo or bi, depending on whether the initial state of H was lipo) or 
In this case, one can infer the initial state of H with certainty by inspecting 
only the ancilla /C. More generally, it is still possible to reliably determine from 
the ancilla /C whether the open system Ti was initially in state [(.i'o) or as 
long as the standard deviations are small compared to the difference in mean. 





Pib) 












\bi-bo\ J 










b 



Fig. 2: Probability densities p of S for input |V'o) and IV^i) 

As the ratio increases, the restriction (3.4) gets less severe, reaching triviality 
at (To + CTi = 2|6o - 6i|. 

Decoherence on the Commutcint of the Pointer 

Assume perfect information transfer, i.e. ctq = <ti = 0. If \A, B\ = 0, then theo- 
rem 3.3 says that coherent and mixed initial states yield identical distributions 
of A at time t. In order to distinguish, at time i, whether or not "H was in a 
pure state at time 0, we will have to use observables A which do not commute 
with B. But then A and B cannot be observed simultaneously. Summarizing: 

At time t, it is possible to distinguish whether % was in state ipQ 
or at time 0. It is also possible to distinguish whether T-L was 
in state tp or |aop|V'o)(V'o| + |Q!i|^|V'i)(V'i| o.^ time 0. But it is not 
possible to do both. 

We emphasize that this holds even when one has all observables of the entire 
closed system /C (g) "H at one's disposal. 
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Decoherence on the Open System 

We consider the final state of the open system %, obtained from the final state 
of the closed system /C (g) 'H by tracing out the degrees of freedom of the ancilla 
/C: an initial state p S S{H) yields final state tTx.{T{p)) € S{H). 

Suppose that information is transferred to a pointer B = B®l'm the ancilla 
/C with perfect quality that is ctq = (Ti = 0. Since [1 (g) A, i? (g) 1] = 0, we see 
from theorem 3.3 that tr(T(|V')(-0|)(l ® A)) must equal tr(T(|aoP|'0o)(V'o| + 
|Q!ip|V'i)(^i'i|)(l O A)) for all A e B{n), or equivalently 

trK(T(|V')(V'|)) =tr^(T(|aonV'o)(V'o| + |ainV'i>(V'i|)) • (3-5) 

In words: 

Suppose that at time t, by making a hypothetical measurement of B 
on the ancilla, it would be possible to distinguish perfectly whether 
the open system had been in state ipQ or ipi at time 0. Then, by 
looking only at the observables of the open system, it is not possible 
to distinguish whether H had been in the pure state tp = aoV'o+ctiV'i 
or the collapsed state \ao\^\tlJo){'4>o\ + o.t time 0. 

This statement holds true, regardless whether B is actually measured or not. 
(So we do not assume here that such a measurement is physically possible.) We 
have shown that the map Mq + Mi = trfc o T, with T : T {%) ^ T {K. ® U) 
the information-transfer operation defined by T{p) := U{t ® p)U* , constitutes 
a physical realization of the abstract decoherence mapping (Mq + Mi) of sec- 
tion 3.2. 

All in all, we have proven that decoherence is an unavoidable consequence 
of information transfer out of an open system. 

Example 

The simplest possible example of unitary information transfer is the following. 
Let /C ~ 'H ~ be the Hilbert space of a qubit; let Vo = (1, 0) and tpi = (0, 1) 
be the 'computational basis', and let : (g) (g) be the 'controUed- 

not gate'. Explicitly, U is defined by U\'tpi (g = \tpQ (g [/|V'o ® V^i) = 
|V'i®'0i)) f^lV'i^V'o) = l^i'i^V'o), and /7|V'o®V'o) = IV'o^V'o)- That is, it flips 
the first qubit whenever the second qubit is set to 1. Let r be the state of the 
first qubit. 

Since the initial state of the second qubit can be read off from the first, 
this situation satisfies the hypotheses of theorem 3.3 with B = 1 and 
ctq = (Ti = 0. We verify equation (3.5). For any state = aolV'o) + ailV'i)- 

C/lV'oOV') = Q;o|V'oOV'o) + Q!i|^iOV'i) := 
trK:(|^)(^|) = |ao|'|^^o)(^o| + |aiPlV'i)(V'i|- 

Thus wc have trx:(T(|V')(V'|)) = laoI^IV-'oXV'ol + biPlV'iXV'il- This agrees with 
equation (3.5), as one can easily check that tryc(T(|Q:oP|V'o)(V'o| + |o'iPlV'i)(V'i|)) 
equals |aoPl^o)(V'o| + laiPlV'iXV'il as well. 
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3.3.2 Unitairy Information Transfer and State Collapse 

We have derived that, in the context of information transfer to an ancillary 
system, the initial states |V')(V-'I and |aop|V'o)(V'o| + |Q!iP|V'i)(V'i| lead to the 
same final state. This is decoherence. 

State collapse is a much stronger statement: if outcome '0' is observed, then 
the system will further behave as if its initial state had been tpQ instead of ip. 
Similarly, if outcome '1' is observed, then the system will behave as if its initial 
state had been tpi. Now suppose that we ignore the outcome. Since '0' happens 
with probability |aoP and '1' with probability jaip, the system will behave as 
if its initial state had been |q;oP|V'o)(V'o| + |c«iPlV'i)(V'i|- We see that collapse 
implies decoherence. 

The converse does not hold however: imagine a Stern-Gerlach experiment, 
in which a beam of particles in a C7a,-eigenstate is split in two according to 
spin in the i;-direction. State collapse is the statement that one beam consists 
of particles with positive spin, the other of particles with negative spin and 
that both beams have equal intensity. Decoherence is the statement that both 
outgoing beams together consist for 50% of positive-spin particles and for 50% 
of negative-spin particles. The former statement is strictly stronger than the 
latter, and deserves separate investigation. 

We will therefore answer the following question: suppose that we transfer 
information to an ancilla K., and then separate IC from 'H, dividing 'H into 
subensembles according to outcome. What states do we use to describe these 
sub ensembles? 

Joint Probability Distributions 

A special case of an observable is an event p, which in quantum mechanics is 
represented by a projection P. The relative frequency of occurrence of p is given 
by ¥{p = 1) tr(pP). 

The projection 1 — P is interpreted as 'not p'. Furthermore, if a projection 
Q corresponding to an observable q commutes with P, then PQ is again a 
projection. According to quantum mechanics, p and q can then be observed 
simultaneously, and the projection PQ is interpreted as the event 'p and q are 
both observed'. 

A state p therefore induces a joint probability distribution on p and q: 

tr{pPQ)^¥{p^l,q = l) , P{p = 0,q = l)=tr{p{l-P)Q) 
tr(pP(l-Q))=P(p=l,g = 0) , P(p = 0,g = 0)=tr(p(l-F)(l-Q)) 

Particularly relevant is the case in which p is a state on a combined space JC®T-L, 
and the projections are of the form Q®\ and \®P. (The commuting projections 
are properties of different systems.) We then see that P(p = l^q — \) equals 
tY{{l® P){Q®l)p), which in turn is equal to tr (Ptr^(((3 ® l)/?))- This holds 
for all projections P on T-L, so that the normalized version of tY]c{{Q l)p) S 
T{%) must be interpreted as the state of T-L, given that q = 1- Similarly, the 
normalized version of trx:((l — Q) ® Ip) € T('H) is the state of %, given that 
g = is observed. 
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Collapse 

Let T : p ^ U{t (g) p)U* from TCH) to T(/C 'H) be an information transfer 
from "H to a pointer-projection Q € B(/C). That is, tr((0 (g) l)T(|-0o) (V^oD) = 
and tr(((5 (g l)T{\tpi){t{ji\)) = 1, so that at time t, one can see from fC whether 
H was in state Vo or ipi at time 0. 

Since Q (E) 1 commutes with all of 1 (g B{H), it is possible to separate H 
from fC, and divide H into subensembles according to the outcome of Q. This 
is done as follows: with any measurement on "H, a simultaneous measurement 
of Q on /C is performed to determine in which ensemble this particular system 
should fall. It follows from the above that the 1-ensemble should be described 
by the normalized version of Mi{p) := tr/ci{Q(El)T{p)), and the 0-ensemble by 
the normalized version of Mo{p) := tr^((l — Q (E) l)T(p)). Since Q commutes 
with B{T-l), this is just conditioning on a classical probability space at time t. 
We have arrived at an interpretation of the map M{p) := Mq{p) ® Mi(p) of 
section 3.2. 

We will now prove thatMtakestheformM(|V')(f/'|) = |aoptrK;T(|i/'o)(V'o|)© 
\ai\^tr)cT{\'tjji){'tjji\). This is a strong physical statement. For instance, any 
spin-system aolV'o) + ctilV'i) that is found to have spin 1 in the z-direction 
may subsequently be treated as if it had been in state at time 0. This is 
nontrivial: a priori, it is perfectly conceivable that the different initial states i/'o 
and ip result in different final states, even though they yield the same Q-output. 

One could alternatively, (and more traditionally), arrive at the 'collapse 
of the wavefunction' M(|V)(V|) = |ao|HrK:T(|V'o)(V'o|) 6 |ai|2trK;T(|Vi)(V'i|) 
by assuming that, at time 0, the quantum system makes either the jump 
I— >■ \ipo){''Po\ or the jump i— > Since we arrive at the 

same conclusion, namely the above 'collapse of the wavefunction', using only 
open systems, unitary transformations and the probabilistic interpretation of 
quantum mechanics, such an assumption of 'jumps' at time is made redun- 
dant. 

Proposition 3.4 Let T : p ^ U{t (g) p)U* from T{n) to T{IC (g) H) satisfy 
tr{{Q (g) l)T(|V'o)(V'o|)) = and tr((Q (g l)T(|V^i)(V'i|)) = 1 for some 'pomter- 
projeetion' Q on IC. Define a map M : T{n) ^ T{H) ® T{H) by M{p) := 
trK:((l-Q<g)l)T(p))®trjc(((5(gl)T(p)). Then for i; = aoipo + aii^i we have 
M(|V)(^|) = |ao|'tr;cT(|Vo)(V'o|)e |ai|Hr;cT(|^i)(^i|). 

This can be seen almost directly from proposition 3.1: 

Proof. Since Mi(|^/»o)(V'o|) > is a positive operator, we may conclude from 

tr (Ml (IV^o) (V'ol)) = that Mi(|'(/'o)('0o|) itself must vanish. Similarly, re- 
versing the role of and 1, we see that Mo{\ipi) = 0. From proposi- 
tion 3.1, we then obtain M(|V')(V'|) = |ao|2Mo(|V'o)(V'o|) 6 \ai\'^Mi{\%l){){%l)x\). 
The proof is completed by noting from tr^((l — Q (g l)T(|7/)i)(-!/'i|)) = that 
tr;c(r(|V'i>(V'i|)) = trK((Q(g l)T(|Vi>(^i|)) = Mi(|t/-i)(t/-i|), and similarly 
again reversing the role of and 1, that tr;<;(7'(|V'o)(V'o|)) = -^o(|V'o)(V'o|)- 1^ 
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We summarize: 

Consider an ensemble of systems of type H in state tp. Suppose that 

information is transferred to a pointer-projection Q on an ancillary 
system K.. Subsequently, the ensemble is divided into two subensem- 
bles according to outcome. Then all observations on H made after- 
wards, conditioned on the observation that the measurement outcome 
was 0, will be as if the system had originally been in the collapsed 
state ipo instead of tp. No independent 'collapse postulate' is needed 
to arrive at this conclusion. 

Example 

In the simple model of information transfer introduced in Section 3.3.1, we will 

now demonstrate why repeated spin-measurements yield identical outcomes. 

The probed system is once again a single spin 'H = C^, whereas the ancillary 
system now consists of two spins, JC = C^(8)C^ in initial state IV'o'^V'o)- Repeated 
information-transfer, first to pointer a^.i said then to 02,2, is then represented 
by the unitary U :— U2U1 on /C (g) "H. In this expression, Ui is the controlled 
not-gate flipping the first qubit of /C if is set to 1, and U2 fiips the second 
qubit of /C if "H is set to 1. 

Since U\tpo (81 V'o ® (aoV'o + aiV'i)) = |ao'0o ® ^/'o ® V'o) + \aiipi (g) V'l ® V'l)) 
we can explicitly calculate the joint probability distribution on the two pointers 
az,i and az,2 in the final state: 



In particular, wc sec that if the first outcome is 1 (which happens with proba- 
bility Iccip), then so is the second, proposition 3.4 shows that this is the general 
situation, independent of the (rather simplistic) details of this particular model. 

3.3.3 Information Leakage to the Environment 

On closed systems decoherence does not occur, because unitary time evolution 
preserves the purity of states. However, macroscopic systems are almost never 
closed. 

Imagine, for example, that H ~ represents a two-level atom, and K. 
some large measuring device. Information about the energy 1 (g) cr^ of the atom 
is transferred to the apparatus, where it is stored as the position B (g) 1 of 
a pointer. Then as soon as information on the pointer-position B ® 1 leaves 
the system, collapse on the combined atom-apparatus system takes place. For 
example, a ray of light may reflect on the pointer, revealing its position to the 
outside world. (See [JZ85].) It is of course immaterial whether or not someone 
is actually looking at the photons. If even the smallest speck of light were to 
fall on the pointer, the information about the pointer position would already be 
encoded in the light, causing full collapse on the atom-apparatus system. (See 
[Zur82] for an example.) 



P(Sz,l= l,Sz,2 = l) = |aip 

P(S^,1 = -1,S,,2 = 1)= 



=P(s,,i= 1,S^,2 = -1) 

ao|'=P(s^,i = -l,Sz,2 = -l) 
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The quality of this information transfer will not be perfect. If a macroscopic 
system is interacting normally with the outside world, (the occasional photon 
happens to scatter on it, for instance), then a number of macroscopic observables 
X will leak information continually, with a macroscopic uncertainty a. This 
enables us to apply theorem 3.3. It says that all coherences between eigenstates 
tpxi and 'tl'x2 of macroscopic observables X are continually vanishing on the 
macroscopic system £, provided that their eigenvalues xi and X2 satisfy the 
inequahty \xi — X2 \ ^ 2a. (The pointer, e.g. a beam of light, is outside the 
system, so that 6 — 0.) 

Take for example a collection of N spins, with Hilbert space £ = (^^^C^. 
Suppose that the average spin-observables Sa = jf X^iLi "'a (for = x, y, z) are 
continually being measured with an accuracy'^ N~^^'^ <C u <C 1. Then between 
macroscopically different eigenstates of Sa, i.e. states for which the eigenvalues 
satisfy \sa — -Sal ^ o", coherences are constantly disappearing. However, the 
information leakage need not have any effect on states which only differ on a 
microscopic scale. Take for instance p |+)(+| and p (g) |— )(— |, with p an 
arbitrary state on — 1 spins. Indeed, — < 2/N <^ a, so theorem 3.3 is 
vacuous in this case: no decoherence occurs. 

We see how the variance produces a smooth boundary between the macro- 
scopic and the microscopic world: macroscopically distinguishable states (in- 
volving S'tj-differences ^ a) continually suffer from loss of coherence, while 
states that only differ microscopically (involving S'a-differences <C a) are unaf- 
fected. 

In case of a system monitored by a macroscopic measurement apparatus, 
we are interested in coherence between eigenstates of the macroscopic pointer. 

By definition, these eigenstates are macroscopically distinguishable. We may 
then give the following answer to the question why it is so hard, in practice, to 
witness coherence: 

// information leaks from the pointer into the outside world, deco- 
herence takes place on the combination of system and measurement 
apparatus. In practice, macroscopic pointers constantly leak infor- 
mation. 

3.4 Closed Systems 

Closed systems evolve according to unitary time evolution, so that coherence 
which is present initially will still be there at later times. Yet on macroscopic 
systems, coherent superpositions are almost never observed. Why is this so? 

3.4.1 Macroscopic Systems 

Because of the direct link that it provides between the scale of a system on the 
one hand, and on the other hand the difficulties in witnessing coherence, we 

^Since [Sx, Sy\ ^ 0, they cannot be simultaneously measured with complete accuracy, see 
e.g.[Wer01]. However, this problem disappears if the accuracy satisfies > \\\{Sx:Sy\\\ = 

, see [Jan04]. For large N, (typically W ~ 6 x 10^^), this allows for extremely accurate 
measurement. 
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feel that the following line of reasoning, essentially due to Hepp [Hep72], is the 

most important mechanism hiding coherence. 

Let us first define what we mean by macroscopic and microscopic observ- 
ables. We consider a system consisting of A'' distinct subsystems, i.e. /C = 
^iLi ^i- If thinks of JCi as the atoms out of which a macroscopic system 
K. is constructed, TV may well be in the order of 10^"^. 

We will define the microscopic observables to be the ones that refer only to 
one particular subsystem /Cj: 

Definition 3.1 An observable X € B{IC) is called microscopic if it is of the 
form X = l(g)...(g)l(g)Xj(g)l(g)...(8)l for some i £ {1,2, ... , N} and some 



In this situation we will identify Xi € B{K,i) with X G B{JC). We take macro- 
scopic observables to be averages of microscopic observables 'of the same size': 

Definition 3.2 An observable Y G B{JC) is called macroscopic if it is of the 

form Y = J2f=i with Y, G B(/C,) such that \\Yi\\ < \\Y\\. 

We will only use the term 'macroscopic' in this narrow sense from here on, even 
though there do exist observables which are called 'macroscopic' in daily life, 
but do not fall under the above definition. 

Now suppose that we transfer information from a system "H to a macroscopic 
system /C = /Cj, using a macroscopic pointer B G B{K). As explained 

before, we then have a map T : T('H) T(/C ® Ti) such that the pointer 
B ®1 has different expectation values 60 and 61 in the states T(|^/'o)(V'o|) and 



and r(|^/'i)(^/'i[) to be eigenstates of B. Instead, we will require their standard 
deviations in B to be negligible compared to their difference in mean, that is 
CTo < l&o - &1I and ai < |&o - 

After this information transfer, we try to distinguish whether the system 'H 
had initially been in the coherent state aolV'o) + ct\\ip\) or in the incoherent 
mixture |q!oP|V-'o)(V-'o| + |cnPl'/'i)('/'i|- Wc have already shown that this cannot 
be done by measuring observables in 1® B((H). The following adaptation of 
theorem 3.3 shows that it is also impossible to do this by measuring macroscopic 
or microscopic observables on the closed system K®'K. 

Corollary 3.5 Let tpo and ipi be orthogonal unit vectors in a Hilbert space H 

and let r G 7~(/C) be a state on the Hilbert space JC = 0^1^ JCi. Let U : JC(g)n ^• 
K^fL be unitary and define T : T{H) -)■ T{IC O H) by T{pl = U{t ® p)U*. 
Let B be a macroscopic observable in B{K), and define B := B ®\. Denote by 
bj and a'j its expected value and variance in the state T{\xpj){ipj\) for j = 0, 1. 
Suppose that bo ^ &i. Then for all ip = aoV'o + (Xii^i with |aop + jaip = 1 and 
for all microscopic and macroscopic observables A G B{)C0 7i), we have 



Xi G B(/Ci). 




tr(T(|V;)(V|)^)-tr(T(|aonV'o)(V'o|+|aillV'i)(V'i|)^) < 



j^\\B\\+ao+a, 



Pll- 



60 - bi 



62 



Chapter 3 



Proof. If A is microscopic, we have the inequality 



\[AB] 



N 



m\m\ 

N 



If A is macroscopic, we have 

JV JV 



JV JV JV 

[^'^]|| = II [iv^T E^- ^ E^^] II = NW+1) E II [^-^^111 ^ 



i=0 j=l 

Either way, we can now apply theorem 3.3. 



i=l 



B\\ 



N 



□ 



3.4.2 Examples 

In order to illustrate the above, we discuss four examples of information transfer 
to a macroscopic system. 



The Finite Spin-Chain 

We study a single spin "H = in interaction with a large but finite spin-chain 
/C = C^, the latter acting as a measurement apparatus. Once again, let 

ipo = (1,0) and ipi = (0, 1) be the 'computational basis'. Initially, all spins in the 
spin-chain are down: t = |-0o ® • • • ® V-'o) (V-'o O • • • (8) V'o I • Let Ui : JC(E>'H ^ JC(E)H 
be the 'controUed-not gate', which flips spin number i in the chain whenever 
the single qubit is set to 1. (We define Uj = 1 for j ^ {1,2,..., N}.) 



C/j = 1 (8> P_ -h a^^i (g) P+ with P+ = 



1 




p_ = 




1 



In discrete time n S Z, the unitary cvohition is given by n i-^ UnUn-i ■ ■ ■ U2U1. 
(See [Hep 72].) This represents a single spin fiying over a spin-chain from 1 to 
A'', interacting with spin n at time n. 

Obviously, we have Un\'4'o ® ■■ - ^ V'o) ® IV'o) = IV'o ® . . . ® ipa) ® |V'o) and 
Un\'^o ® ■ ■ ■ ® V'o) ® IV-"!) ~ \'^Ji ® ■ ■ ■ ® ipi) (X) We consider the average 

spin of the spin-chain as pointer, B = X]il=i '^z,i- This makes the map T : 
p Unt pU^ an information transfer to a macroscopic system. Applying 
Corollary 3.5 with 60 = —1, b\ = 1 and <jq = ai = 0, we see that 

tr(T(|aoVo + ai^i'i)(ao^o + aiVi|)^)-tr(T(|ao|^|V'o)(V'o| + |a?||^i'i)(^i|)A) 

is bounded by || A|| for all microscopic and macroscopic A E B{IC(E)H)- It is not 
hard to see that in this particular model, the estimated quantity is even identi- 
cally zero. Indeed, (■i/'o O • • • V'O | V'l <H) • • • V'O = (V'O | V'i)^"MV'o | ^iV-i) = 
for all microscopic Xi. 

Of course coherence can be detected on the closed system K. (E)'H, but only 
using observables that are neither macroscopic nor microscopic, such as the 
'product of all spins' ■ ■ ■ cr^- 



63 



Finite Spin-Chain at Nonzero Temperature 

A more realistic initial state for the spin-chain is the nonzero-temperature state 

Tl3 = tre-fiH ' ^'^^ spin-chain Hamiltonian wc will take H = az,i = NB, 
so that Tjs becomes the tensor product of N copies of the C^-state 

With the same time evolution as before, we have Tdi/ig) ("^oD — IV'o)(^ol ® Pp 
and T{\tpi){tpi\) = \tpi){tpi\ (g) Again we choose the mean energy B as 

our pointer. A brief calculation shows that tr(BT^) = =■ and that 

tr(B^T^)— tr(Bp^)^ = ^(1— £^(^)). Corollary 3.5 now gives us, for microscopic 
and macroscopic A, 

tr(T(|aoVo + ai^i)(aoV'o + aiVi|)^)-tr(T(|ao|Vo)(V'o| + |a?||V'i)(^i|)^) 

For large N, we see that the term ~ ^ due to the fact that [A,B] 7^ is 
dominated by the thermodynamical fluctuations, which of course go as 
In statistical physics, it is standard practice to neglect even the latter. 

Energy as a Pointer 

Hamiltonians often fail to be macroscopic in our narrow sense of the word, since 
they arc gcucrically unbounded and contain interaction terms. However, this 
does not imply failure of our scheme to estimate coherence. 

For example, consider an A^-particle system with Hilbert space fC = fCi 

and Hamiltonian H = J^iLi + V{xi,X2, ■ ■ ■ ,Xn). Information is trans- 
ferred from H to K, with H as pointer, that is the two states tr-u{T{\tpQ) (i/jqI)) 
and tr-fi{T{\tpi){tpi\)) have different energies E and E' . Without loss of gen- 
erality, assume that they are vector states: tr-u{T{\tpo){'4>o\)) = \i'){''P\ and 
tr^(T(|-0i) (-01 1)) = \'ip'){tjj'\. (Density matrices can always be represented as 
vectors on a different Hilbert space, cf. the proof of theorem 3.3.) 

We thus have two vector states \tp) and l'^') with two different energies, 
E :— Htp) and E' := {tp' \ Hip'). We estimate the coherence between |^) 
and l'^') on a;„, the position of particle n. 

{E-E'){i,\Xn1p') = {E^l,\Xni^')-{Xn-4^\E'i,') 

= (ij^ _ (ij _ E)iP I x^i,') - (x„V I H^Ij' -{H- E')iP') 
= {[H, xn]i^ I V') - {{H - E)i; I Xni^') + (x„V I {H - E')^;') 

Now since [i?, .x„] = 2^\Pn^-^n] = "^^^^ , we can apply the Cauchy-Schwarz 
inequality in each term to obtain 
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If we define the characteristic speed Vn ■= \/ (V' I (Pn/'7i„)^V'); the character- 
istic positions X„ := sj (t/i | a;^V) and X'^ := ^ \ and the standard 
deviations a := v^(i/' | {H - Efi:) and cr' := ^(V-' | (i? - i^')V>. we obtain 

KV'knV' )l < - 



|£;-£;'| 

As such, this doesn't tell us very much. We will have to make some physically 
plausible assumptions on the state of the system in order to obtain results. First, 
wc assume that the system is encased in an L x L x L box so that X„, X'^ < L. 

Also, we assume Vn < c. This yields K'f/' | a;„V')l < ^"^e^e']'^ ■ Secondly, we 
assume that scaling the system in any meaningful way will produce \E—E'\ ~ N 
and a+a' <^ VN, so that the coherence on a;„ approaches zero as ~ Notice 
the almost thermodynamic lack of detail required for this estimate. 



Schrodinger's Cat 

Let us finally analyze the rather drastic extraction of information from a ra- 
dioactive particle that has become known^ as 'Schrodinger's cat'. (See [Sch35].) 
The experiment is performed as follows. We are interested in a radioactive 
particle. Is it in a decayed state ipo or in a non-decayed state ijji? 

In order to determine this, we set up the following experiment. A Geiger 
counter is placed next to the radioactive particle. If the particle decays, then 
the Geiger counter clicks. A mechanism then releases a hammer, which smashes 
a vial of hydrocyanic acid, killing a cat. All of this happens in a closed box no 
higher than Im, and completely impenetrable to information. A measurement 
of the atom is done as follows: first, place it inside the box. Then wait for a 
period of time that is long compared to the decay time of the atom. Finally, 
open the box, and inspect whether the cat has dropped dead or is still standing 
upright. 

The atom is described by a Hilbert space "H, the combination of Geiger 

counter, mechanism, hammer, vial and cat by a Hilbert space K.. Initially, the 
latter is prepared in a state \0). As a pointer, we take the center of mass of the 
cat, Z := J2i=i ^i- III this expression. A'' is the amount of atoms out of which 
the cat is constructed, and Zi is the z-componcnt of particle number i. (It is a 
harmless assumption that all atoms in the cat have the same mass.) Since the 
box only measures Im in height, we may take \\Z\\ — 1. The unitary evolution 

^Actually, Schrodinger's proposal was slightly different. In the original thought experiment, 
death of the cat was correlated with decay of the atom at time t instead of 0, which wouldn't 
make it an information transfer in our sense of the word. 
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U e B(/C O H) then produces U\tpo O 6) := [70) and [/'IV'i O 61) := I71), which 
are eigenstates^ of Z with different eigenvalues. 

Suppose that, initially, the atom is either in the decayed state ipo with prob- 
ability |aoP or in the non-decayed state tpi with probability |aip. That is, the 
initial state is the incoherent mixture |aoPl''/'o)('0ol + |Q^iPl^i)(^i|- By linearity, 
the final state is then the incoherent state |q;oP|7o)(7oI + |q^iP|7i)(7i|- 

On the other hand, if the atom starts out in the coherent superposition 
oioli^o) + then the combined system ends up in the coherent state 

U\{aoi>o + aiV'i) '^0) = ao|7o) + ai|7i)- 

The question is now this: why do we not notice the difference between these 
two situations if we open the box? First of all, according to theorem 3.3 (and the 
observations following it in section 3.3.1), it is impossible to detect coherence 
between 70 and 71 and ascertain the position of the cat. Upon opening the 
black box, we must make a choice. 

Secondly, according to the discussion in section 3.3.3, the coherences between 
the macroscopically different states 70 and 71 are extremely volatile. Any speck 
of light falling on the cat will reveal its position with reasonable accuracy, caus- 
ing the coherence to disappear according to theorem 3.3. 

Yet even if wc were able to open the box without any information on the 
position of the cat leaking out, even then would we be unabk^ to d(^t(K't coherence 
between 70 and 71 . Apply Corollary 3.5 to the transfer of information from atom 
to cat. We have ctq = ui = 0, and with pointer Z we have ||Z|| = 1 (the height 
of the box is 1 m) and zi — zq =0.1 (the difference between a cat that is 
standing up and one that has dropped dead is 10 cm). We then obtain for all 
macroscopic and microscopic A: 

|(ao7o + Q:i7i | Aao7o + Q!i7i) - (|ao|^(7o | ^7o) + laifili \ ^7i))| < ■ 

On the subset of observables we are normally able to measure, the distinction 
between coherent and incoherent mixtures practically vanishes for N ^ 10^'^. 
For all practical intents and purposes, it is completely harmless to assume that 
the final state of the cat is |Q;oP|7o)(7o| + |ain7i)(7i| instead of ao|7o) + ai|7i)- 
But it would be false to state that the former has actually been observed. 

3.5 Conclusion 

In open systems, we have proven that decoherence is a necessary conseriueiice 
of information transfer to the outside. More in detail, we have reached the 
following conclusions: 

- Suppose that an open system H interacts with an ancillary system JC in 
such a way, that it is possible, in principle, to determine from K, whether 
H had been in state iJjq or ipi before the interaction. If H started out in a 
coherent state aol^o) + ctilV'i)) then it will behave after the information 

^As discussed before, it would be more realistic to allow for a nonzero variance < aj <^ I 
instead of requiring 8j to be eigenstates of Z. We use aj = for clarity, leaving the argument 

essentially unchanged . 
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transfer as if it had started out in the incoherent mixture |q!o|^|V'o)(V'o| + 
|q:iPIV'i)(V'i| instead. This is called 'decoherence'. 

- Suppose again that the information whether % was in state V-'o or V-'i is 
transported to an ancillary system K.. This is done with an ensemble of T-L- 
systems described by the state adV'o) +Q!i|V'i)- The ensemble is then split 
into subensembles. according to outcome. The '0-ensemble' then behaves 
as if it had been in state f/'o at the beginning of the procedure, and the 
'1-ensemble' as if it had started in state V'l- This is called 'state collapse'. 

- These results were obtained entirely within the framework of traditional 
quantum mechanics and unitary time cvohitioii on a larger, closed system 
containing 'H. No 'reduction-postulate' is needed. From proposition 3.1, 
we see that any information extraction causes collapse, quite independent 
of its particular mechanism. 

- On the closed system containing the smaller, open one no decoherence 
occurs in principle. In practice however, closed systems are very hard to 
achieve. We have argued that information transfer from a macroscopic ob- 
servable A, performed with macroscopic precision cr, causes decoherence 
between eigenstates of A if their values satisfy cr <C |ai — ao|. Since infor- 
mation on macroscopic observables tends to leak out, coherence between 
macroscopically different states tends to vanish. 

Still, even if the combined system /C07^ is considered perfectly closed, there are 
some results to be obtained. Again, we investigated the case that a system % 
interacts unitarily with a system /C in such a way that the information whether 
% was in state '^o or can be read off from a pointer in K. We have reached 
the following conclusions concerning the closed system /C 'H: 

- Using only observables on the closed system that commute with the pointer, 
it is impossible to detect whether % had started out in state aolV'o) + 
ai\tpi) or |aoPlV'o)(V'o| + l«iPlV'i)(V'i|- Physically, this means that it is 
impossible to distinguish between coherent and incoherent initial states 
while at the same time distinguishing between ipQ and tpi. 

- Suppose that the closed system K ® % macroscopic, and that one 
has access to its macroscopic and microscopic observables only. Then 
it is almost impossible to distinguish whether T-L had started out in state 
QiolV'o) or |q:oP|V'o)(^o| + |aiPlV'i)(V'i|- We have obtained upper 
bounds on the coherences (V'o I Aipi), evaluated on microscopic or macro- 
scopic A. Assuming perfect information transfer (o"o = ci = 0), they 
approach zero as ~ ^, where N is the size of the system. 

In short: no decoherence ever occurs on perfectly closed systems, even if they 
are macroscopic. It is just very hard to distinguish coherent from incoherent 
states, creating the false impression that it does. 
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The link between decoherence and macroscopic systems was brought forward 
by Klaus Hepp in his fundamental paper [Hep72], where he considered infinite 
closed systems, displaying decoherence in infinite time. In infinite systems, 
the microscopic observables form a non-commutative C*-algebra A. Its weak 
closure A" is considered as the (von Neumann-) algebra of all observables. The 
macroscopic observables form a commutative algebra C which is contained in the 
centre of A", i.e. C C Z = {Z e A"\[Z. A] = \/A e A"}., yet is almost disjoint 
from the microscopic observables: C H A = CI. Transfer of information to a 
macroscopic observable therefore implies perfect decoherence on all microscopic 
and macroscopic observables (cf. section 3.3.1). 

Unfortunately, this transfer cannot be done by any automorphic tinu^ (n'olu- 
tion, since the macroscopic observables are central. Hepp proposed information 
transfer by a t — >■ oo limit of automorphisms. He was able to show that this 
causes decoherence in the weak-operator sense. That is. on each fixed micro- 
scopic observable, the coherence becomes arbitrarily small for sufficiently large 
time t. 

The paper was criticized by John Bell a few years later [Bel75], on the 
grounds that, for each fixed time t, there are observables to be found on which 
coherence is not small. Since Bell was of the opinion that a 'wave packet reduc- 
tion', even on closed systems, 'takes over from the Schrodinger equation', this 
was not to his satisfaction. He did agree however that these observables would 
become arbitrarily difficult to observe in practice for large t. 

By considering large but finite closed systems subject to unitary time evo- 
lution, we hope to clarify the role that macroscopic systems play in making us 
mistake coherent superpositions for classical mixtures. It seems striking that 
the same, simple mathematics can also be used to understand why open systems 
do undergo decoherence as soon as they lose information. 
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Chapter 4 

Unifying Decoherence and the 
Heisenberg Principle 

The laws of quantum mechanics impose severe restrictions on the quahty of 
measurement. In this chapter, we investigate some of these restrictions from a 

quantum probabilistic point of view. We introduce state independent figures of 
merit for measurement performance, and then use them to prove sharp bounds. 

4.1 Introduction 

In quantum probabihty theory, an observable is modelled by a self-adjoint op- 
erator ^ in a von Neumann algebra A, and a state by a positive, normalized, 
linear map p : A ^ C As discussed in chapter 2, a (normal) state induces 
a probability measure on the spectrum of A. It is the objective of a quantum 
measurement to portray this probability measure as faithfully as possible. 
According to the uncertainty relation of Heisenberg, Kennard and Robertson 

<7xC7y>i|p([X,y])|, 

(sec [Hei27, Ken27, Rob29]); there is an inherent variance in the quantum state. 
Furthermore, quantum theory puts severe restrictions on the performance of 
measurement. These restrictions, which come on top of the measurement restric- 
tions implied by the above uncertainty relation, fall into four distinct classes. 

I The impossibility of perfect joint measurement. It is not possible to per- 
form a simultaneous measurement of two noncommuting observables in 
such a way that both measurements have perfect quality. 

II The Heisenberg principle, (see [IIei27]). This states that quantum infor- 
mation cannot be extracted from a system without disturbing that system. 

Ill The impossibility of classical coding. One cannot extract classical informa- 
tion from a quantum system and then use this information to reconstruct 
the quantum state. 
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IV The collapse of the wave function. When information is extracted from 
a quantum system, a so-called decoherence is experimentally known to 

occur on this system. 

We aim to find a quantitative description for each of these fundamental mea- 
surement restrictions. 

The subject of uncertainty relations in quantum measurement is aheady 
endowed with an extensive literature. For example, the Heisenberg principle and 
the impossibility of joint measurement are quantitatively illustrated in [AK65, 
Oza03, Ish91, Hal04]. However, the inequalities in these papers depend on the 
state p, which somewhat limits their practical use. Indeed, the bound on the 
measurement quality can only be calculated if the state p is known, in which 
case there is no need for a measurement in the first place. 

In section 4.2, we introduce two state independent figures of merit: the 
maximal added variance and the measurement infidelity. These will lead us quite 
naturally to state independent bounds on the performance of measurement. 

In section 4.3, we introduce our main tool in the proof of these inequalities: 
on operator valued Cauchy-Schwarz inequality. 

In section 4.4, we prove a sharp, state independent bound on the performance 
of jointly unbiased measurement. This generalizes the impossibility of perfect 
joint measurement. 

In section 4.5, we prove a sharp, state independent bound on the performance 
of a measurement in terms of the maximal disturbance that it causes. This 
generalizes the Heisenberg principle. 

In section 4.6, using a result of Keyl and Werner, we prove a sharp, state 
independent bound on the performance of classical coding. 

In section 4.7, we investigate the collapse of the wave function. In con- 
trast with the Heisenberg principle and its abundance of inequalities, this phe- 
nomenon has mainly been investigated in specific examples (see e.g. [Hep72, 
Zur82, JZ85]). Although there are some bounds on the remaining coherence in 
terms of the measurement quality (for example [Sew05], but also theorem 3.3 
in the previous chapter), a sharp, information-theoretic inequality does not yet 
appear to exist. We will provide such an inequality, and prove a sharp upper 
bound on the amount of coherence that can survive information transfer. 

First of all, this generalizes the collapse of the wave function, and shows that 
decoherence is a mathematical consequence of information extraction. But it 
also shows that no information can be extracted if all coherence is left perfectly 
intact, which is precisely the Heisenberg principle. It is therefore a unified de- 
scription of both the Heisenberg principle and the collapse of the wave function. 

4.2 Information Transfer 

We wish to model information transfer between quantum systems. Recall from 
chapter 2 that a quantum system is modelled by a quantum probability space, a 
pair {A, p) of a von Neumann- algebra A and a normal state p G S{A). Infor- 
mation transfer then proceeds as follows. We are interested in the distribution 
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of the observable A e A, with the system A in some unknown state p. We 
perform the operation T* : S{A) ^{B), and then observe the 'pointer' B in 
the system B in order to obtain information on A. 

4.2.1 General Information Transfer 

We take the point of view (see [Hal04]) that any CP-map T : S ^ ^ is an infor- 
mation transfer from any observable ASiAto any pointer B G B. Undeniably 
though, for the purpose of transferring information, some operations are better 
than others. What does it mean for a CP-map T to be 'good' at transferring 
information? 

Perfect Information Trainsfer 

The initial state p defines a probability measure Pj on the spectrum of A by 

Pi(S') := p{ls{A)), where 15(^4) denotes the spectral projection of A associated 
to the set S. Similarly, the final state T* p defines a probability distribution P/ 
on the spectrum of B. 

The object of information transfer is to gain information on P^, the prob- 
ability measure of the observable A before the operation, by looking at the 
probability measure P/ of the pointer B afterwards. We will therefore con- 
sider an information transfer to be perfect if Pj = P/ for any initial state p. 
Translated to algebra, this reads as follows. 

Definition 4.1 An information transfer T : B ^ A from A G A to B G 
B is called perfect if T{B) = A and if the restriction of T to B", the von 
Neumann algebra generated by B, is an isomorphism B" — )• A" of commutative 

nan Neumann algebras. 

Indeed, P, = P/ if and only if Ej(g) = Ey(g) for any measurable function g. 
In other words, T* p{g{B)) = p{g{A)) for all p € S{A) and g e L°°{M). But if 

p{T{g{B))) equals p{g{A)) for any normal state p, wc must have T{g{B)) = g{A) 
and vice versa. Since X" = {g{X)\g e L°°(M)}, the two notions of perfection 
coincide. 

Measurement Infidelity 

Not unlike a man, an information transfer which is interesting is not necessarily 
perfect. The aim of information transfer is to infer Pj from ¥f, so surely T is 
good if Pj and Pj are close. 

Recall that both Pj and P/ depend on the state p. If we choose a metric d on 
the space of probability measures, then we can define the measurement infidelity 
S to be the worst case distance between the input and the output measure, 

5:= sup d(Pi,P/). 

Wc consider T to be a good information transfer from A to B if 5 is small. 

Clearly, T is perfect if and only ii S — 0. 
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As a metric on the space of probability measures, we employ the trace dis- 
tance (a.k.a. variational distance or Kolmogorov distance) 

d(P/,Pi) :=sup{|Pi(5)-P/(5)|}. 

5CR 

It is the difference between the probabihty that the event S occurs in the dis- 
tribution Pi and the probability that it occurs in the distribution Py^, for the 
worst case Borel set S* C M. 

Let us expand this definition, and write it in a more algebraic fashion. 
We have Pi(S') = p{ls{A)) and Vf{S) = p{T{ls{B))), so that suppd(P,,P/) 
can be written sup^^^ \p{ls{A) - T{ls{B)))\. As sup^ \p{X)\ = \\X\\, this is 
just supg ||ls(A) — T(ls(i?))||, and we arrive at the following less intuitive yet 
cleaner expression. 

Definition 4.2 Let T : B ^ A be a CP-map. Its measurement infidelity 6 in 

transferring information from the observable A to the pointer B is defined as 
S := swpg — r(l5(_B))||, where S runs over the Borel subsets ofM.. 

The following is a standard example of a perfect information transfer. 
Example: von Neumann Qubit Measurement 

Let A = M2(C) and A = Uz- Let = {+1, —1} be the spectrum of Uz- Denote 
by L°° (f2) the (commutative von Neumann) algebra of C- valued random vari- 
ables on n. (This is just with pointwise multiplication.) A state L°°(f2) C 
is precisely the expectation E with respect to some probability measure P on 
J7. Define the probability measures P± to assign probability 1 to ±1, so that 
E±(/) = /(±1)- We take B = M2(C) (g) L°°{n) and B = 1 ® {S+i - S-i). 

The von Neumann- measurement T : M2(C) (E) L°°{n) — > M2(C) is then 
defined as T(X ® /) := f{+l)P+XP+ + /(-1)P_XP_, with P+ = \ t)(t I 
and P_ = I |. The dual T* : 5(M2(C)) ^ 5(M2(C)) 5(L°°(f])) reads 
T*p = p{P+)\\){t\®-P+ + piP-)\i){i\^¥- 

In words: with probabihty p{P+), the output 4-1 occurs and the qubit is 
left in state |t). With probability p{P^), the output —1 occurs, leaving the 
qubit in state The von Neumann-measurement T is a perfect information 
transfer from cr^ G M2(C) to 1 ® {d+i - S-i) € M2(C) ® L°°{n). 

POVMs 

Quantum measurements are often (e.g. [Hol82, Hal04]) modelled by Positive 
Operator Valued Measures or POVMs. From a CP-map B ^ A and a pointer 
B e B, we may distill the POVM p : Z;(Spec(P)) A hy way of p{S) := 
T{lg{B)). It will only be projection valued if the information transfer is perfect. 

For instance, the above von Neumann qubit measurement gives rise to the 
POVM p:Vt^ M2 defined by p{+l) = P+ and /i(-l) = P_. 

Note that from the POVM, the CP-map can be reconstructed only on B" . 
A CP-map can thus be seen as an extension of a POVM that keeps track of the 
system output as well as the measurement output. Since we will be interested 
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in disturbance of the system, it is imperative that we consider the full CP-map 
rather than merely its POVM. 

4.2.2 Unbiased Information Transfer 

Not unlike a man, it does an information transfer credit to be unbiased. This 
means that the expectation of the observable A before the operation is equal 
to the expectation of the pointer B afterwards, for any initial state p. In other 
words, if T*p{B) = p{A) for any p. This is equivalent to p{T(B) - A) = for 
any p, and thus to T{B) = A. 

Definition 4.3 A CP-map T : B ^ A is called an unbiased information trans- 
fer from AeAto BeB ifT{B) = A. 

Any perfect information transfer is unbiased, but the converse does not hold. 
Schematically, we have 




Fig. 3: Kinds of information transfer. 



MELximal Added Variance 

For unbiased information transfer, there is a natural figure of merit other than 

the measurement infidelity. The variance of an observable X is defined as 
Yar{X,p) = Ep(X2) - Ep{Xf, or equivalently p{X^) - p{Xf. An unbiased 
information transfer is good if the variance Var(S, T*{p)) of the pointer in the 
final state is as low as possible. 

The output variance can be split in two parts. One part Var(y4., p) is the 
variance of the input, which is intrinsic to the quantum state p. The other part 
Yai{B ,T* [p)) — Yai{A,p) > is added by the measurement procedure. 

It would be unfair to hold the information transfer responsible for uncer- 
tainty that was already there to begin with, so we will take this added variance 
to measure its performance. The maximal added variance, 

:= sup Var(S, T*{p)) - Var(A, p) , 

will be our figure of (de) merit. 

For example, in the case of perfect information transfer from A to B, we 
have that Var(B,r*(p)) = Var(A, p); the maximal added variance is 0. There 
is uncertainty in the measurement outcome, but all uncertainty 'conies from' 
the quantum state, and none is added by the measurement procedure. 
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To obtain a cleaner definition, wc expand Var(i?, T*(p)) — Var(A, p) into 
{p(T{B^)) - p{T{B)f) - (p{A^) - p(Af). Using the fact that T{B) = A, we 
obtain E2 ^ suppp(r(B2) -r(B)2), and thus = \\T{B'^) - T{Bf\\. 

Definition 4.4 The maximal added variance of an unbiased information trans- 
fer T : B ^ A from an observable A = T{B) to a pointer B is defined as 

:=\\T{B'^)-T{Bf\\. (4.1) 

4.3 An Operator Valued Form 

Inspired by formula (4.1). we define an operator valued form which allows us to 
consider S as the 'length' of the pointer: S = yJ\\{B, B)\\. 

Definition 4.5 Let T : B ^ A be a CP-map. We define the operator-valued 
sesquilinear form {■ , ■)t '■ B x B ^ A by 

{X, Y)t := r(Xl'y) - T{X)'iT{Y) . (4.2) 

If there is no danger of confusion, we will drop the subscript T. 

As we will see, the form {■ , ■)t is sesquilinear and positive semidefinite. 
Its most useful feature however is that it satisfies an operator valued Cauchy- 
Schwarz inequality. 

4.3.1 A Cauchy-Schwarz Inequality 

Before proving the operator valued Cauchy-Schwarz inequality, let us first check 
that ( • , • )t is indeed sesquilinear and positive semidefinite. 

Proposition 4.1 The form ( • , ■)t from definition 4-5 is linear in the second 
argument, it satisfies {X,Y)^ = {Y,X), and {X,X) > for all X G B. 

Proof. Linearity in the second argument follows from linearity of T, and the 
fact that (X,F)t = {Y,X) is a consequence of T{X^) = T{X)^. The following 
proof of positivity is taken from [Maa04]. 

As T is completely positive, T (g) id : S ® M2(C) ^ (g) M2(C) is positive, 
so that 

is a positive matrix. Using the fact that A>0 implies B^AB > 0, we find that 

/I OV fT(X^X) T(Xt)\ /I 0\ 
[-TiX) Oj [ T{X) 1 ) \-T{X) Oj 

is also positive, and hence T{X'<X) — T{XyT{X) > as required. □ 
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We are now ready to prove the operator valued Cauchy-Schwarz inequality. 

Simple as it may be, it will be crucial to the remainder of this chapter. All our 
bounds on the performance of information transfer will depend upon it. 

Lemma 4.2 (Cauchy-Schwarz) Let T : B ^ A be a CP-map, and define 
{X, Y) := T(XtF) - T{X)^T{Y). Then for all X,Y G B: 

{X,Y){Y,X)<\\{Y,YMX,X). (4.3) 

Proof. MT : B ^ Ais completely positive\ then so is T (g) id : S (g) M2(C) -> 
A® M2(C). According to proposition 4.1, 



X Y\ fx Y\\ _ ({X,X)t {X,Y) 



oyvo o;;^^.^ \{y,x)T {y,y)t 



T 



is a positive matrix, so that also 



1 Oy ({X,X)t {X,Y)t\ ( 1 0- 
-{Y,X)t Oi \{Y,X)t {Y,Y)t \-{Y,X)t 



(4.4) 



is positive, and therefore {X,X) - 2{X,Y)(Y,X) + {X,Y){Y,Y){Y,X) > 0. 
Rewrite this as {X, Y){Y, X) < {X, X) + {X, Y){{Y, Y) - 1){Y, X) , and substi- 
tute XY for Y. If A is such that 

miY,Y)\\<l, (4.5) 

then |A|2(X,y)(|A|2(r,y) - 1){Y,X) is negative, so that 

\X\'{X,Y){Y,X)<{X,X). (4.6) 

If {Y, Y) = 0, then (4.5) holds for arbitrary A, so that (4.6) forces {X, Y){Y, X) 
to be zero. If (F, F) ^ 0, then taking \\\^ = \\{Y,Y)\\-'^ in (4.6) yields the 
inequality {X,Y){Y,X) < \\{Y,Y)\\{X, X). Either way, we have (4.3), which 
was to be demonstrated. □ 

Remark The special case that {X, X) = implies {X, F) = is due to 
Reinhard Werner, see e.g. [WerOl]. 

We will have occasion for the following corollary. 

Corollary 4.3 Let C : A ^ J\f and T : B ^ A be completely positive maps. 
Define ( • , • )c,t : B x B ^ Af by {X, Y)c,t = C {{X, Y)t). Then 

{X,Y)c,T{y,X)c,T < \\{Y,Y)c,t\\{X,X)c,t. 

Proof. This is a minor adaptation of the proof of lemma 4.2. The map C (8) id : 

^0 M2(C) — > J\f ® M2{C) is positive. If we apply it to each of the three factors 
in equation (4.4) separately, the result will still be positive. This has the effect 
of replacing every ( • , ■ )t that occurs in the entries by ( • , • )c,t- The rest of 
the proof then goes through uninterrupted. □ 



^We state the result for completely positive maps, but it is clear from the proof that 

4-positivity suffices. 
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4.3.2 Perfect Information Transfer 

If an information transfer is perfect, then its maximal added variance = 

II II will be zero. Indeed, the initial and final probability measures are 

identical, so that no variance is added. Using our newly acquired Cauchy- 
Schwarz inequality, we will show that the converse is also true. 

Theorem 4.4 Let T : B ^ A be a CP-map, let B G B be Hermitean. Then 
among the statements 

1 {B,B)=0. 

2 The restriction of T to B" , the von Neumann algebra generated by B, is 
a * -homomorphism B" T{B)" . 

3 {f{B), f{B)) = for all measurable functions f on the spectrum of B. 

4 T maps the relative commutant B' — {X G A] [X,B\ = 0} into T^B)' . 
the following relations hold: (1) (2) (3) => (4). 

Proof. First, we prove (1) =^ (2). By Cauchy-Schwarz (lemma 4.2), we have 
T(B") - T{B)T{B''-^) < ||(B,B)||(S"-i,B"-i) = 0. Induction on ri shows 
that T(B") = r(B)", so we have T{f{B)) = f{T{B)) for all polynomials / 
by linearity. Thus T is a *-homomorphism from the algebra of polynomials 
on the spectrum of B to that on T{B). Since T is weakly continuous, this 
statement extends to the algebras of measurable functions on the spectra of B 
and T{B), isomorphic to B" and T{B)" respectively. For (2) =^ (3), note that 
T{f(Bf ) = T{f{B)f. For (3) => (1), one simply takes f{x) = x. Finally we 
prove the implication (1) ^ (4). Suppose that [A, B\ = 0. Then [T{B),T{A)\ = 
T{[A,B]) - [T{A),T{B)\ = {A\B) - {B\A). {B is Hermitean.) By Cauchy- 
Schwarz, the last term equals zero if {B, B) does. □ 

So the maximal added variance is zero if and only if T is a perfect information 
transfer, i.e. a *-homomorphism B" A. 

We can now give an easy proof of theorem 2.13 in chapter 2. 

Theorem 4.5 Every CP-map T : B ^ A with a completely positive inverse is 
an isomorphism. 

Proof. Let S be the inverse of T. According to proposition 4.1, (A, A)5 > 0, so 
that also T{{A,A)s) > 0. Expanding this as T {S {A^ A) - S {A^ ) S (A)) > and 
using the fact that T is the inverse of S, we see that A'' A > T{S{A^)S{A)). 
On the other hand, TiS{A^)S{A)) - T {S {A'' ))T {S (A)) > implies < 
T{S{A^)S{A)), so that A^ A = TiSiA^)S{A)) for all A e A. For A = T(B), 
this reads T{B)^T{B) = T{B^B) for all S e 6, or {B, B)t = 0. The Cauchy- 
Schwarz inequality hen implies that also (B'\B)t = for all B',B e B, so 
that T{B'B) = T{B')T{B). □ 
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4.4 Joint Measurement 

In a jointly unbiased measurement, information on two observables A and A 
is transferred to two commuting pointers B and B. Lemma 4.2 will provide a 
sharp bound on its performance. 

4.4.1 Joint Measurement Inequality 

If A and A do not commute, then it is not possible for both information transfers 

to be perfect. (See [Ncu32], [WerOl].) The following inequality states that the 
degree of imperfection is determined by the amount of noncommutativity. 

Theorem 4.6 Let T : B ^ A be a CP-map, let B,B be commuting Hermitean 
observables in B, and define A := T{B), A := T{B), Sl := ||(.B,B)|| and 
E|:=||(B,B)||. Then 

SsS^>l||[AA]||. (4.7) 

Proof. Since [B, B] = 0, we have [A, A] = T{[B, B]) - [T{B),T{B)] = {B, B) - 
{B,B). By Cauchy-Schwarz, the latter is at most 2SsEg in norm. □ 

The following application will show that this bound is sharp. 

4.4.2 The Beamsplitter as a Joint Measurement 

A beamsplitter is a device which takes two beams of light as input. A certain 
fraction of each incident beam is refracted and the rest is reflected, in such a 
way that the refracted part of the first beam coincides with the reflected part 
of the second and vice versa. 




Fig. 4: Beamsplitter. 

We will show that the beamsplitter serves as an optimal joint unbiased mea- 
surement. 

The MELximal Added VariEince of the Beamsplitter 

In cavity QED, a single mode in the fleld is described by a Hilbert space H 
of a harmonic oscillator, with creation and annihilation operators and a 
satisfying [a^a^ = 1, as well as x = and p = The coherent states 

\a) = e^l"l /•^ 12^=0 ^^1'^) dense in and satisfy a\a) = a\a). 
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Quantummechanically, a beamsplitter is described by the unitary operator 

U = exp{9{a^ i^a — a(^a^)) onH^H. In terms of the coherent vectors, we have 
U\a)<S)\P) = |Q:cos(0)+/?sin(6'))«)|-Q;sin(6')+/3cos(0)). Note that t/^a^lL/ = 
cos(6')a (g) 1 + sin(6')l <g) a and that Wl<^ all = - sm{9)a 1 + cos(6')l (g) a. 
(This can be seen by sandwiching both sides between coherent vectors.) Since 
the map Y i— > WYU respects +, • and ^ , we readily calculate 

= cos(0)a; (g 1 + sin(0)l (g) X, 

U^x"^ iSilU = cos^(6')a;^ (g 1 + 2 sin(6') cos(6i)a: g) x + sin^(6')l g) x'^, 

U^liSipU = -sin(^)p0 1 + cos(^)l (gp, 

U^ligip^U = sin^(^)p^ (g 1 - 2cos(6i)sin(^)p0p + cos^(6i)l 

Let us identify normal states p on BCH) with normalized trace class operators 

R S T{T-L) by way of p{X) = tr{R,X). We are now interested in the map 
R ^ UR(g) \0){0\U\ from S{H) to S{H) (g S{H). In other words, we feed 
the beamsplitter only one beam of light in a state p, the other input being the 
vacuum. The dual of this is the CP-map T : Bin) g) BCH) B(n) defined by 
T{Y) := id (g ^{WYU), with $ the vacuum state = (0|X|0). 

Take for instance the pointer B = cos-'^{e)x 1. Then T{B) = a;(0|l|0) + 
tan(6')l(0|a;|0) = x. Starting with the pointer B = — sin~"'^(6')l g) p instead, 
we end up with T{B) = p. Apparently, splitting a beam of light in two parts, 
measuring a;0l in the first beam and l0p in the second, and then compensating 
for the loss of intensity provides a simultaneous unbiased measurement of x and 
p in the original beam. Since [x,p\ = i, we must'^ have SbS^ > |. 

We now calculate Sb and Eg explicitly. Prom (0|a;^|0) = 5, we see that 
r(B2) = x^ + itan2(6l)l. Thus E| = \\{B,B)\\ = itan2(6'). Similarly S| = 
|tan~^(0). We see that S^S^ = i, so that the beamsplitter is indeed an 
optimal jointly unbiased measurement. 

4.4.3 Sharpness and Compcirison 

The beamsplitter is an optimal joint measurement in the sense that it minimizes 
the product S^E^ of the maximal added variances. By scaling B, optimal joint 
measurements can be found for arbitrary values of Eg and E^, which shows 
the bound in theorem 4.6 to be sharp. 

The beamsplitter also performs well with other figures of merit. For ex- 
ample, if the quality of joint measurement is judged by the state-dependent 
cost R{T) := Var(_B,T*(p)) + Var(B,T*(/3)), then at least for Caussian p, the 
optimal measurement is the above beamsplitter with = 7r/4. (See [Hol82].) 

Choosing a figure of merit is -to a degree- a moral decision. Is it fair to 
prefer the angle 9 = tt/A over the others, or should should we deem all angles to 
be of equal value? The former point of view leads to the figure of merit R{T), 
the latter to E^E^. 

We will take the second point of view, and propose the product of the two 
maximal added variances as a figure of merit for joint unbiased measurement. 



^We neglect the technical complication of x and p being unbounded operators. 



79 



In chapter 5, we will show that homodyne detection of the spontaneous decay of 
a two-level atom constitutes a joint measurement with S^E^ = 1.056, slightly 
above the bound Sb^b — ^ provided by theorem 4.6. 

4.5 The Heisenberg Principle 

The Heisenberg Principle may be stated as follows: 

// all states are left intact, then no quantum information can be 
extracted from a system. 

This alludes to information transfer from an initial system ^ to a final system 
consisting of two parts: the system A and an ancilla B, containing the pointer B. 
We thus have an information transfer T : A<S) B ^ A from A to 1<S) B. 

An initial state p G S{A) gives rise to a final state T* p ^ S{A®B). Restrict- 
ing this final state to the system A (i.e. taking the partial trace over B) yields a 
'residual' state R*p G S{A), whereas taking the partial trace over A yields the 
final state Q* p E S(B) of the ancilla. We define the CP-maps i? : ^ ^ ^ by 
R{A) := T{A (g)l) andQ : B ->■ Ahy Q{B) := T(l ® B). The map R describes 
what happens to A if we forget about the ancilla B, and Q describes the ancilla, 
neglecting the original system A. 




Fig. 5: Constructing R* and Q* from T* . 

We wish to find a quantitative version of the Heisenberg principle, i.e. we want 
to relate the imperfection of the extracted quantum-information to the amount 
of state disturbance. 

4.5.1 A Figure of Merit 

We have already established a figure of merit for the quality of information 
transfer. For general information transfer, we use the measurement infidelity 
5 of definition 4.2, and for unbiased information transfer the maximal added 
variance Y? of definition 4.4. We must now find a figure of merit to quantify 
how well states on A are preserved by R* . 

For any initial state p € S{A), we want the final, residual state R* p to be 
as close to p as possible. We therefore define the maximal disturbance A to be 
the maximum trace distance between the initial and the final state. 



A := sup d{p, R* p) ■ 



(4.8) 
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The trace distance or Kolmogorov distance between two states p and a is 
defined as the worst case difference in probabihty that an event P occurs, 

d{p,a) := sup(p(P) - <7(P)) , 
p 

where P runs over the projections (or 'events') in A. It is shown in [NCOO] that 
one may equivalently have P run over all positive operators < P < 1, rather 
than just the projections. 

The name 'trace distance' stems from the fact that if p = tr(i? • ) and 
a = tr{S ■ ) for trace class operators R and S, then d{p, a) = |tr(|i? — S\). 

In order to arrive at a cleaner definition, replace R* (p) (P) in 

A = sup{|P*(p)(P) - p{P)\ ; p e S{A) , < P < 1} 

by p{R{P)) in order to obtain A = supo<p<i ||P(P) — P||- Summarizing, we 
arrive at the following quantification of state disturbance. 

Definition 4.6 The maximal disturbance A of a map R: A is given by 

A := sup{||P(P) - P|| ; PgA,0<P<1}. 

One has A = if and only if the operation R leaves all states of the original 
system perfectly in place, or equivalently iff P = id. 

4.5.2 HP for the Unbiased Case 

We first turn our attention to unbiased information transfer. The imperfection 
of the transfer is then captured in the maximal added variance E^. 

The Heisenberg principle only holds for quantum-information. Classical ob- 
servables are contained in the centre Z = {A S .4 ; [A, X] = VX G A}, 
whereas quantum observables axe not. The degree in which an observable A is 
'quantum' is given by its distance to the centre d{A, Z) = \nizez \\A — Z\\. In 
the following, we will take the algebra of observables to be ^ = B{T-L) for some 
Hilbert space %. The centre is then simply CI. 

Theorem 4.7 Let T : B{n) ®B ^ B{U) he a CP-map, let B & B he Her- 

mitean. Define A := T(l (g) B), and := \\ {1(g) B, 1(g) B)\\. Furth ermore, 
define A := supp{||P(P) - P\\}, with R the restriction ofT to B{Wj (g) 1. Then 

i - A 

S > diA, Z) , ^ . (4.9) 

VA(1 - A) 

This hound is sharp in the sense that for all A e [0, |], there exist T and A for 
which (4-9) attains equality. 

We prepare the ground for theorem 4.7 by means of the following lemma. 

Lemma 4.8 Let P he a projection, and let < X < 1 be such that \\P — X\\ < 
A. Then \\X{1 - X)\\ < A(l - A). 
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Proof. Let x be in Spec(X), the spectrum oi X. As < X < 1, we have 
< a; < 1. Without loss of generality, assume that there exists a normalized 
eigenvector -0 such that Xil) = xtp- (If this is not the case, one may complete 
the proof using approximate eigenvectors.) Decompose ip over the eigenspaces 
of P, i.e. write ip = Xi ~^ Xo, with xi ™ the image and xo in the kernel of P. 
Then {X — P)-)p — {x - l)xi + xxo- Since ||xi|P + llxolP = 1, the inequality 
A2 > \\{X-P)^P\\^ = (a;-l)2||xi||2+x2||xof implies that either (l-x)^ < 
or x"^ < A^. We conclude that Spec(X) C [0, A] U [(1 - A),l]. This implies 
that Spec(X - X^) C [0, A(l - A)], as desired. □ 

We proceed with the proof of theorem 4.7. 

Proof. We may assume A < ^, as inequality (4.9) is trivially satisfied otherwise. 
Let X := sup(Spec(yl)) and y := inf (Spec(A)), so that d{A,Z) = {x — y)/2. 
Without loss of generality, assume that there exist normalized eigenvectors 
and tpy satisfying Atp^ = xtp^ and A^py = yipy. (If this is not the case, one uses 
approximate eigenvectors.) Define V- := (V'a; + V'j/)/v^) and B := 

On the one hand, we have \\R{B) - B\\ < A, so that \\[A, R{B) - B]\\ < 
2Ad{A,Z). On the other hand, we have = d{A,Z). We then use the 

triangle inequality to see that 

\\[T{1®B),T{B®1)]\\ = \\[A,R{B)]\\ 

= \\[A,B] + [A,R{B)-B\\\ 
> d{A,Z){l-2A). 

This brings us in a position to apply theorem 4.6 to the commuting pointers 
S 1 and 1 (g) B, yielding 

2J:^J\\{B(»1,B(»1)T\\ > d{A, Z)(l - 2A) . (4.10) 

In order to estimate \\{B ® l^B ® 1)t|1 = IK-B, -B)/?]], we use the fact that 
B'^ = B to write {B,B)r = R{B) - R{Bf. Lemma 4.8 with X = R{B) and 
P = B then yields \\R{B) - R{Bf\\ < A(l - A). Inserting 

V^A^T^ > •yiloi^V^^iH 

into inequality (4.10) yields 2E^A(1 - A) > d{A, Z){1 - 2A), which was to be 
demonstrated. For sharpness of the bound, see section 4.7.5. □ 

In the case of no disturbance, A = 0, we see that S — >■ oo. No information 
transfer from A is allowed if all states on A are left intact. This is Werner's 
(sec [WcrOl]) fornmlation of the Heisenberg principle. 

In the opposite case of perfect information transfer, S = 0, inequality 4.9 
shows that A must equal at least one half. We shall see in section 4.7 that this 
corresponds with a so-called 'collapse of the wave function'. 
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These two extreme situations are connected by theorem 4.7 in a continuous 
fashion, as indicated in the graph below. 

6.0 -H 
5.0 

4.0 -> 




Fig. 6: The combinations (A, S) below the curve are for- 
bidden, those above are allowed. (With d(^, Z) = 1.) 

The upper left corner of the curve illustrates the Heiseuberg principle, whereas 
in the the lower right corner, we can see the collapse of the wave function. 

4.5.3 HP for General Information Transfer 

We now prove a version of the Heisenberg Principle for general information 
transfer. We formulate it as a trade-off between the maximal disturbance A 
and the measurement infidelity 5. 

Corollary 4.9 Let T : B{H) B B{n) he a CP-map, let A e B{n) and 

B e B be Hermitean, A Z = CI. Define A := supp{|li?(P) - P\\}, with R 
the restriction ofT to B{n) ® 1. Define 5 := sups{|lT(l ® ls{B)) - ls{A)\\}. 
Then, for 6 and A in [0, |], we have 

(i-5r + (i-Af <i. (4.11) 

This bound is sharp in the sense that for all A S [0, |], there exists a T for 
which (4-11) attains equality. 

Proof. Choose a nontrivial subset S of Spec(^) and put P := 1 (g) ls{B). 
Since ||T(P) - l.s(.4)|| < 5, lemma 4.8 yields = \\T{P) - T{P)'^\\ < S{1 - 5). 
From the proof of the lemma, we see that Spec(T(P)) is contained in [0, 5] U 
[1 — 5,1]. Since Spec(T(P)) contains points in both [0,6] and [1 — 6,1], we 
have d{T{P),Z) > \ — 6. Then apply theorem 4.7 to the pointer P to obtain 
VW^) > (i-<5)(i-A)/VA(l - A), orequivalently(i-<5)2 + (l-A)2< i. 
For sharpness, see section 4.7.5. □ 
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A measurement which does not disturb any state (A = 0) cannot yield infor- 
mation {S > |). This is the Heisenberg principle. On the other hand, perfect 
information {6 — 0) implies full disturbance (A > i), corresponding to the 
collapse of the wave function. Both extremes are connected in a continuous 
fashion, as depicted below. 



0.5 -1 



5 




Fig. 7: The combinations (A, S) below the curve are for- 
bidden, those above are allowed. 

4.5.4 Application: Resonance Fluorescence 

Corollary 4.9 may be used to determine the minimum amount of disturbance if 
the quality of the measurement is known. Alternatively, if the system is only 
mildly disturbed, one may find a bound on the attainable measurement quality. 
Let us concentrate on the latter option. 

We investigate the radiation emission of a laser-driven two-level atom. The 
emitted EM-radiation yields information on the atom. A two-level atom (i.e. a 
qubit) only has three independent observables: a^, Cy and cr^. There are various 
ways to probe the EM field: photon counting, homodyne detection, heterodyne 
detection, et cetera. For a strong (fi ^ 1) resonant (wiasor = '^atom) laser, we 
will use corollary 4.9 to prove that any EM-measurement of ax, Oy or will 
have a measurement infidelity of at least 

with A the coupling constant. For a measurement with two outcomes, 5 is the 
maximal probability of getting the wrong outcome. 

Unitary Evolution on the Closed System 

The atom is niodcUed by the Hilbert-space (only two energy-levels arc 
deemed relevant). In the field, we discern a forward and a side channel, each 
described by a bosonic Fock-space J-. The laser is put on the forward channel, 
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which is thus initially in the state $n, the coherent state with frequency w and 
strength fl. (The field strength is parametrized by the frequency of the induced 
Rabi-oscillations) . The side channel starts in the vacuum state $o- 

If we permit ourselves the luxury of identifying a state with its trace class 
operator, then the state at time t is given by 

T;(p) = i7(i)(p0$n0$o)C/^(i). 

with time evolution 

jUt = -i{Hs + HF + XHi)Ut. 

Hs G -B(C^) is the Hamiltonian of the two-level atom, Hp £ B{J^ ® that of 
the field and XHi e -B(C^) ® B{T ® T) is the interaction-Hamiltonian. Define 
the interaction-picture time evolution by 

flip) := Ur{t)^U2{t)^T;{p)U2{t)Ur{t) , 

where U\{t) := e~'^^* and U2{t) := e~'^^* are the 1-parameter groups of 
unitarics for the 'unperturbed' time evolution. 

We now investigate Tj instead of T) . We are looking for a bound on the 
measurement infidelity 5 = supg{||T(l $$ ls{B)) — ls{A)\\} of T, but if we move 
to the interaction picture and define B BU2: then f(ls(i3)) = T{ls{B)), 

so that 5 = sup5{||f (1 ® Is(-B)) - ls(^)||} = 5. If we find the interaction- 
picture disturbance A, then corollary 4.9 will yield a bound on 5, and therefore 
also on 5. 

In the weak coupling limit A 4- 0, the interaction picture time evolution Tt 
is given by f^ip) = U(t/X'^){p $n ^o)UHt/\'^), where the evolution of the 
unitary cocycle t n- J7t is described (sec [AFL90]) by a Quantum Stochastic 
Differential Equation or QSDE. Explicitly calculating the maximal added vari- 
ances by solving the QSDE is in general rather nontrivial, if indeed possible 
at all. 

In chapter 5, we will do this for the case of spontaneous decay, i.e. 17 = 0, 
with the map Ti restricted to the commutative algebra of homodyne measure- 
ment results. But here we take an other approach. 

Master Equation for the Open System 

Fortunately, in contrast to the somewhat complicated time evolution Tt of the 
combined system, the evolution restricted to the two-level system is both well- 
known and uncomplicated. If we use as a unit of time, then the restricted 
evolution [p) := trjr^jrTj* (p) of the two-level system is known (see [BGM04]) 
to satisfy the Master equation 

jkl{p) = L{Rl{p)), (4.12) 

with Liouvillian L{p) := lin[e-'^'^-^'>'^V + e^'^-^^^V^ , p] - ^{V^V. p} + V pV^i . 
In this expression, E is the energy-spacing of the two- level atom and = 
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V = a- are its raising and lowering operators. In the case u> = E oi resonance 
fluorescence, we obtain 

L{p) = lin[v + v^,p] - l{v^v,p} + vpv^ . 

If we parametrize a state by its Bloch- vector Rf{p) = ^(1 + xax + ycy + zaz), 
then equation 4.12 is simply the following differential equation on R^: 




This can be solved explicitly. For f2 1, the solution approaches 

° ^ (^^\ 

e-t*cos(Ot) e-3*sin(m) ■ 
-e-t*sin(m) e-?*cos(m)/ V^o/ 

If we move to the interaction picture once more to counteract the Rabi oscilla- 
tions, i.e. with U\{t) = es^*'^"' and U2 = 1, we see that the time evolution is 
transformed to 

x\ ( e~2* \ /xoX 
y = e-i* 2/0 • 

Since the trace distance D{p, r) is exactly half the Euclidean distance between 
the Bloch vectors of p and r, (see [NCOO]), we see that A = ^(1 — e~3*). For 

any measurement of (Jx, cry or cr^, wc therefore have ^ > | — — e~i^ by 
corollary 4.9 (remember that t is in units of A~^). 



0.5 -1 




Fig. 8: Lower bound on d in terms of t (in units of A ^). 
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4.6 Classical Coding 

In classical coding, a single quantum state is encoded into classical information. 
Decoding this classical information in order to regain the original quantum 
state is known to be impossible. However, one can attempt to construct a state 
which comes as close as possible. We give bounds on the smallest possible trace 
distance between the original and the decoded state which can be reached. 

We start with an approach based on the operator-valued Cauchy-Schwarz 
inequality. It will yield a bound, but in contrast to the case of joint measurement 
and the Heisenberg principle, it will not be sharp. We will then prove a sharp 
bound, starting from Keyl and Werner's no-cloning theorem [KW99]. 

4.6.1 ClEissical Coding 

We are interested in operations that take quantum states into classical prob- 
ability measures and vice versa. As we have seen in chapter 2, a classical 
probability space (0,1], P) corresponds to a commutative quantum probability 
space (.4, p). The algebra is given by the random variables 'up to measure zero', 
A := E.P), and the state is simply the expectation p{f) = J^^ f{uj)¥{doj). 

An operation that maps quantum states in S{B(T-L)) to probability measures 
in S{A) must therefore be the dual of a CP-map C : A ^ B{'H), with A := 
S,P). According to section 4.2.1, such a map is precisely a POVM 
E — )• B{H). Similarly, an operation that maps classical probability distributions 
into quantum states must be the dual of a CP-map D : B{H) — )• A. 

Classical Coding 

By classical coding, we mean the following procedure. First, classical informa- 
tion is extracted from a quantum system. This is described by the coding map 
C : A ^ B{H), with A = L^{il, E). Then, on the basis of this classical infor- 
mation, the original state is reconstructed as well as possible by means of some 
decoding procedure. This is described by a CP-map D : B{'H) A. In the 
dual picture, this gives 

S{B{H)) ^ S{A) ^ S{B{H)) . 

The coding procedure is flawless iff every state is reconstructed perfectly, i.e. iff 
CoD: Bin) -> Bin) is the identity 

It is well known [WerOl, Maa04] that perfect classical coding is impossible. 
So let us investigate how close we can come to perfection. 

A Figure of Merit 

To define a figure of merit for classical coding, we proceed in close analogy to 
the maximal disturbance A. 

For any initial state p G >S(S('H)), we want the final, reconstructed state 
D*C* p to be as close to p as possible. We therefore define the coding im,per- 
fection A to be the maximum trace distance between the initial state and its 
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reconstruction, 

A:= sup d{p,D*C*p). 
peSiBin)) 

Comparing this with equation (4.8), we see that this is precisely the maximal 

disturbance for the map R* = D*C*. We can follow the reasoning leading up 
to definition 4.6 word by word to arrive at the following reformulation. 

Definition 4.7 A classical coding procedure consists of a commutative von 
Neumann algebra A, and a pair of CP-maps C : A ^ B{T-L) and D : B{T-L) — )■ A. 
Its coding imperfection is defined as 

A= sup \\B-CD{B)\\. 

0<B<1 

The coding inipcrfection qTiantifics the quality of the coding procedure: the 
better the procedure, the smaller the imperfection. One would only have A = 
if all states could be encoded perfectly. We now investigate how close to zero 
A can come. 

4.6.2 Classical Coding Inequality 

We use the Cauchy-Schwarz inequality to obtain a the bound on A for any clas- 
sical coding procedure. (The proof is inspired by the 'impossibility of classical 
coding' version in [Maa04].) 

Theorem 4.10 Any classical coding procedure has coding imperfection 

A > (3- \/5)/4. 

Proof. Take two orthogonal vectors ^H. and define X to be the projection 
on ■0, and Y the projection on (^ + (t))/\/2. We have y]|| = 1/2. Since A 
is Abelian, we have D{X)D{Y) = D{Y)D{X), and we can write 

[X,Y] = [X,Y]-CD{[X,Y])+ (4.13) 
C {D{XY) - D{X)D{Y)) - (4.14) 
C{D{YX)- D{Y)D{X)) . 

We will bound the r.h.s. in terms of A. Remembering that the l.h.s. is at least 

1/2 in norm will then yield a minimum value on A. 

We start with (4.13). Like any antihermitean operator, [X, Y\ can be written 
as [X,F] = i{A+ - A_), with < A± < = \l. Therefore, we have 

\\[X,Y]^CD{[X,Y])\\ < A||A+|| +AP_|| < A. 

We then proceed with (4.14). Consider the positive semidefinite sesquilinear 
form (X, Y)c,D ■= C {D{X^Y) - D{X)^D{Y)), in terms of which the expres- 
sion (4.14) equals {X,Y)c,d — {y,X)c,D- According to corollary 4.3, we then 
have 

\\iX,Y)c,D - iY,X)c,n\\ < 2^\\{X,X)mY,Y)\\. 
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Now {X,X)c,D = {X,X)cD - {D{X),D{Xj)c < {X,X)cd, which in turn 
can be written {X,X)cd = CD{X){I - CD{X)) because X^X = X. 

As X is a projection, and \\X — CD{X)\\ < A, we can apply lemma 4.8 
to obtain \\CD{X) - CD{Xf)\\ < A(l - A), or equivalently \\{X,X)cd\\ < 
A(l - A). Of course, we similarly have ||(y,y)cD|| < A(l - A). All in all, we 
can bound (4.14) as \\{X,Y)c,d - {Y,X)c,d\\ < 2A(1 - A). 

We conclude that 1/2 = < A + 2A(1 - A), or (A - 3/4)2 < r^/^Q 

Thus A > (3 - 75) /4, which was to be proven. □ 

The bound A > (3 — ■\/5)/4 is unfortunately not the best possible one. However, 
the good news is that the best bound follows easily from the no-cloning theorem 
of Keyl and Werner [KW99]. 



4.6.3 The No-Cloning method 

Each classical coding procedure yields a 'cloning machine'. All one has to do 
is to decode the classical information an arbitrary amount M of times, rather 
than just once. A bound for cloning will therefore induce a bound for classical 

coding. 

Theorem 4.11 Any classical coding procedure on a Hilbert space T-L of dimen- 
sion d has coding imperfection 

A>^. 
- d+1 

Proof We set H = C<^, and A = i°°(il) with Q = {1, 2, . . . , n}. The 'diagonal' 

map n n'^ : i i— >■ (z, . . . , i) induces the 'classical cloning' map K : A'^^ — > 
A, i.e. {Kf)(i) := f(i, . . . ,i). (Note that for e.g. = M, a cloning map poses 
difficulties.) 

The composition T := C o K o 1?®*^, mapping B(C'^)®^ to BiC^), is a 
so-called 1 M doner. By construction of the operation T, it is clear that 
T(/ O . . . O B . . . O 7) = CD{B). The main result of [KW99] then says that 
suPo<B<7 \\T{I ®...®B®...®I)-B\\ > Since M e N+ was 

arbitrary, this implies A > n 

The mmibcr (3 — \/5)/4 ~ 0.19, obtained from the Cauchy-Schwarz inequality, 
is inferior to the | fa 0.33 from the no-cloning theorem in the best possible case 
d = 2, in usefulness as well as in the standard order on M.. Theorem 4.11 is thus 
the better result. 



4.7 Collapse of the Wave function 

Recall that the Heisenberg principle states that if all states are left intact, then 
no information can be extracted from a system. The 'collapse of the wave 
function' may be seen as the flip side of the Heisenberg principle. It states that 
if information is extracted from a system, then its states undergo a very specific 
kind of perturbation, called decoherence. 
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4.7.1 Collapse for Unbiased Information Transfer 

We start out by investigating unbiased information transfer. We prove a sharp 
upper bound on the amount of remaining coherence in terms of the measurement 

quality. 

Theorem 4.12 Let T : Ai^ B ^ A be a CP-map. Let B e B be Hennitean, 
and consider T as an unbiased information transfer from A = T(10B) tol^B 

with maximal added variance T,'^ = j] (l(K'i3, IfXiB)]! . Suppose that ipx and ipy are 
unit eigenvectors of A with different eigenvalues x and y respectively. Define, 
for |Q:p + |^|^ = l, the coherent and the incoherent state 

Pcoh := laip^o + I3tpy){atpx + Ptpy\, pinc := \a\'^\tpx){tl'x\ + \0\'^\'4'y){i'y\ ■ 

If R: A^ A is the restriction ofT to A^l, then we have the following bound 
in trace distance: 

d(i?*(peoh),i?*(pinc)) < , = . (4.15) 

^l + A{i:/\x-y\f 

This bound is sharp in the sense that for all values ofT,/\x — y\, there exists a 
CP-map T for which (4-15) attains equality. 

0.5 
0.4 

Coherence 
0.3 

0.2 

0.1 



0.0 1.0 2.0 3.0 4.0 5.0 6.0 
Z/\x-y\ 

Fig. 9: Bound on the coherence as a function of E/|x — y|. 
Only points below the curve are allowed. 

Consider the ideal case of perfect (E = 0) information transfer. Suppose that 

the system A is initially in the coherent state pcoh = + Pipy){c^ipx + /^V-'yl- 
Then theorem 4.12 says that, after the information transfer to the ancilla B, the 
system A cannot be distinguished from one that started out in the incoherent 
state Pine = |q^P|V'2;)(V-'2;| + l/^P I "01/) (''/'ji I instead. As far as the behaviour of A 
after the information transfer is concerned, it is therefore completely harmless 
to assume that a collapse 

\ai>x+^^y){a^oc+l3ijy\ ^ \af\iJx){^x\ + \l3f\i'y){^y\ 

will have occurred at the start of the procedure. 
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Proof. The l.h.s. of equation (4.15) can be written 

sup{ I i?(P)Vy) + c.c. ; P e >l, = = P } . 

Furthermore, since 2|a| < 1, it suffices to bound the 'coherence' {-ipx \ R{P)ipy) 
on all projections P. Now (x — y){ipx \ R{P)'4'y) = (V'k I [^j -R(P)]'0y)) and fur- 
thermore [A, R{P)] = (P (E)1,1(E) B) - {1(E) B,P (g)l). Therefore 

{x-y){'4,x\R{P)^y) = (V^,|(P®l,l®B)V'y)-(V'c.|(l<^5,P<^l)V'y>, (4.16) 

and we will bound these last two terms. 

By Stincspring's theorem [StiSS, Tak79], we may assume without loss of 
generality that T is of the form TiX) = XV for some contraction V. We 
then obtain iX,Y) = X^l - VV^)YV. Defining g{X) := Vl - VV^XV, 
we have iX,Y) = g{X)^g{Y), and therefore [[^(l P)|| = S. This leads to the 
estimate 

{ij^l {PE) 1,1 ®B)i;y) = {g{PE)l)iJx\g{l<EB)^Py) 



< 



||g(P® 1)^,11115(1 ®i?)V'yl| 



< Sv'(V'. I(P<^1,P® 

We will bound {^^ \ {T{P^ ®1)- T{P®lf)i^x) = (V'. | {R{P) - i?(P)')V'.) in 
terms of the coherence. For brevity, let us denote {tjjx \Xtjjx') by X^x'- Since 
V'c. -L i)y, we have {R{Pf)xx > \RiP)xx\^ + \R{P)xy\^ so that 

(i?(P) - RiPf)xx < R{P)xx{l - R(P)xx) - \R{P)ccy\^ ■ 

Since x{l — a;) < j for all x € M, this is at most j — |P(P) 
All in all, we have obtained 



|2 



xyl 



(P 1, 1 ® B)xy < - \R{P)xy\^ , 

and of course the same for x o y. Plugging this into equation 4.16 yields 



la; - y\\R{P)xy\ < 2E^i - |P(P),,| 



or equivalently the desired result \R(P)xv\ < ^/N y\ Pqj. sharpness, see 
section 4.7.5. ^ □ 

We have seen that if E = 0, then theorem 4.12 is precisely the collapse of the 
wave function. Now consider the other extreme of a measurement which leaves 
all states intact, i.e. R* p = p for all p. Then there exist states for which 
the l.h.s. of equation (4.15) equals |, forcing S — > oo; no information can be 
obtained. This is Werner's formulation of the Heisenberg principle. 

Theorem 4.12 thus unifies the Heisenberg principle and the collapse of the 
wave function. For S = we have a full decoherence, whereas if all states are 
left intcict, we have S — >^ oo. For all intermediate cases, the bound 4.15 on the 
remaining coherence is an increasing function of — t/|. 

This agrees with physical intuition: decoherence between ipx and ipy is ex- 
pected to occur in case the information transfer is able to distinguish between 
the two. This is the case if the added variance is small w.r.t. the differences in 
mean. 
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4.7.2 Application: Perfect Qubit Meaisurement 

In section 4.2, we have encountered the von Neumann Qubit measurement. 

Now consider any perfect measurement T of with pointer 1 ® {5-^- — 5^) 
which leaves | t)(t I and | | in place, that is i?*(| t)(t I) = I t)(t I and 
= l4-)(4-|- (Such a measurement is often called nondestructive.) 



Theorem 4.12 then reads R*{\a t +^ i){a t +/3 4 |) = |Q!|2|t)(t| + 



Incidentally, the trace distance between the centre of the Bloch sphere and its 
surface is so that we read off A = sup{£)(i?*(p), p); p e ^(Ms)} = \. This 
was predicted by theorem 4.7. 

4.7.3 Collapse for Nondestructive Meatsurement 

We will prove a sharp bound on the remaining coherence in general information 
transfer. The word 'general' in the previous sentence means that the mea- 
surement need not be unbiased, but we will focus attention on nondestructive 
measurements. 

A measurement of A is called 'nondestructive' (or 'conserving' or 'quantum 
nondemolition') if it leaves the eigenstates of A intact, so that repetition of 
the measurement will yield the same result. For example, the measurement in 
section 4.2.1 is nondestructive, the one in section 4.5.4 is destructive. 

Restricting attention to the nondestructive case is certainly not uncommon 
in quantum measurement theory (see [Per93]). In all fairness though, we make 
the iioiideinolitioii assumption out of technical convenience, not because we 
believe it to be of fundamental importance. 



as illustrated below. 



z 




Fig. 10: Collapse on the Bloch-sphere for perfect measurement. 
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Corollary 4.13 Let T : BCH) (g) B ^ BCH) be a CP-map, and let A e B{U) 

and B € B be Hermitean. Suppose that A has an orthonormal basis of eigenvec- 
tors ipi with eigenvalue a,. If ai ^ aj, and |ap + |/3p = 1, define the coherent 
and the incoherent state 

Pcoh := loiipi + P'ipj){ail)i + I3ipj\, Pine := |ap|'0i)(V'i| + \P\'^\i>j){'>Pj\ ■ 

Suppose that T is nondestructive, i.e. R* (Itpi) {tpil) = \i^i){tpi\ for all tpi, with 
R the restriction of T to B(T-L) ® 1. Then if the measurement infidelity 6 := 
sups{||T(l ® Is(-B)) - satisfies S € [0, i], then 

d[R* (Pcoh) , Pine) < - S) . (4.17) 

This bound is sharp in the sense that for all S € [0, |], there exists a CP-map 
T for which (4-17) attains equality. 
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Fig. 11: Bound on the coherence in terms of 5. 
Only points below the curve are allowed. 

Proof. For sharpness, see section 4.7.5. Choose a set S such that ai G S and 
Uj ^ S. T is an unbiased measurement of T(l (g) ls{B)) with pointer ls{B) 
and maximal added variance < S{1 — S) (cf. the proof of corollary 4.9). We 
will prove that ipi and ipj are eigenvectors of T(l (g ls{B)) with eigenvalues x 
and y which differ by at least 1 — 26. 

Define Pi := Since T is nondestructive, we have {ipj \ R{Pi)ipj) = 

{ipj I Pi ipj) for all j. Apparently, R{Pi) has only one nonzero diagonal element, 
a 1 at position (z,i). Since R{Pi) > 0, this implies R{Pi) — Pi. 

From {Pi ®l,Pi®l) = 0, we obtain {Pi (g 1, 1 (g ls(_B)) = by Cauchy- 
Schwarz. Since P^ = T{P^®1), we have [T{mis{B)), P^\ = (Pi(gl, l(gls(B))- 
(1 (g) \s{B),Pi (g) 1) = 0. Therefore i>i is an eigenvector of T(l Is(-B)), with 
eigenvalue x, say. By a similar reasoning, is also an eigenvector, denote its 
eigenvalue by y. 

Since \\T{\®\s{B)) - ls{A)\\ < 6, we have in particular ||(T(1 ® ls(B)) - 
ls{A))iP,\\ ^\x-l\ < 6 and ||(T(1 ® ls{B)) - ls(A))^j|| = \y\ < S, so that 
\x — y\ > 1 — 26. We can now apply Theorem 4.12. 
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On the l.h.s. of the bound (4.15), we may substitute R*{pinc) = Pine on 
account of T being nondestructive. On the r.h.s., we substitute S = \/d{l — 6) 
and \x — y\ = {1 — 26). Strikingly enough, this yields the bound 

(v/5(T^/(l - 2S))/^l + 4(5(1 - 6)/il - 2Sr) = VSO^) □ 

For perfect measurement {6 = 0), this yields R*{\a^l)i + (iipj ){aipi + /3tpj\) = 
\a\'^\tpi){tpi\ + \/3\'^\ipj){il>j\; all coherence between ipi and ipj must vanish. This 
collapse of the wave function is illustrated in the lower left corner of fig. 11. 
On the other hand, if all states are left intact so that R* = id, then we must 
have 5 = 5; no information can be gained. This is illustrated in the upper 
right corner of fig. 11. Corollary 4.13 is a unified description of the Heisenberg 
principle and the collapse of the wave function. 



4.7.4 Application: Nondestructive Qubit-Measurement 

In quantum information theory, a Cz-measurement is often taken to yield output 
+1 or —1, according to whether the input was |t) or It is nondestructive 
if it leaves the states |t) and \l) intact, yet it is only unbiased if it is perfect. 
Corollary 4.13 shows that in the nondestructive case, the Bloch-sphere collapses 
to the cigar-shaped region depicted below: 




Fig. 12: Collapse on the Bloch-sphere with 6 = 0.01. 

Current single-qubit readout technology is just now moving into the regime 
where the bound (4.17) becomes significant, in [LD+06], a nondestructive 
measurement of a SQUID-qubit was described, with experimentally determined 
measurement infidelity 6 = 0.13. The bound then reads rf(i?*Pcoh) Anc) ^ 0.336. 
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4.7.5 Sharpness of the Bounds 

We have yet to prove sharpness of the bounds. As it turns out, the following 
CP-map will do the trick for all the bounds in a single stroke. 
Let n = {+1,-1}, and B = L°°(f7). Let 

+ ■ I ^J' [o ' 

and define T : M2{C) ® B ^ A'hiV) by 

T{X ® /) := f{+l)V+XV+ + f{-l)V-XV- . 

For p = 0; this is the von Neumann- measurement. As a measurement of CTz 
with pointer B := — <5_)/(l — 2p), we have 5 = p. This yields bounds 
on the disturbance and on the coherence. Corollary 4.9 and theorem 4.7 yield 
A > i — a/p(1 — p), corollary 4.13 and theorem 4.12 yield (pcoh) , Pine) < 
\/p{X--p) We now explicitly calculate the restriction of T to M2, and find 

^ ^ ( Pii 2yp(l-p)pi2\ 

V2Vp(i-p)p2i P22 ; ■ 

The maximal remaining coherence occurs for a = (5 = 1/^/2, for which it equals 
■\/p(l — p). The maximal disturbance equals A = ^ — \/p{l — p). This shows 
the bounds (4.9), (4.11), (4.15) and (4.17) to be sharp. 

4.8 Conclusion 

Our expedition into the heart of darkness has brought us the following spoils. 

I Theorem 4.6 provides a sharp, state independent bound on the perfor- 
mance of unbiased joint measurement of noncommuting observables. In 

the case of perfect (S = 0) measurement of one observable, it implies that 
no information whatsoever (S' = 00) can be gained on the other. 

II Theorem 4.7 (for unbiased information transfer) and corollary 4.9 (for gen- 
eral information transfer) provide sharp, state independent bounds on the 
performance of a measurement in terms of the niaxinial disturbance that 
it causes. In the case of zero disturbance, when all states are left intact, 
it follows that no information can be obtained. This is the Heisenberg 
principle. 

Ill Theorem 4.11, based on a result of Keyl and Werner, provides a sharp, 
state independent bound on the quality of classical coding. In contrast 
to I, II and IV, the result based on the Cauchy-Schwarz inequality is not 
optimal in this case. 
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IV Theorem 4.12 (for unbiased information transfer) and corollary 4.13 (for 
nondestructive information transfer) provide a sharp upper bound on the 
amount of coherence which can survive information transfer. For per- 
fect information transfer, all coherence vanishes. This clearly proves that 
decoherence on a system is a mathematical consequence of information 
transfer out of this system. If, on the other hand, all states are left in- 
tact, then it follows that no information can be obtained. This is the 
Hciscnbcrg principle. Theorem 4.12 and corollary 4.13 connect these two 
extremes in a continuous fashion; they form a unified description of the 
Heisenberg principle and the collapse of the wave function. 

All these restrictions follow directly from the formalism of quantum probability 
theory as set forth in chapter 2. 
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Optimal Pointers for Joint 
Measurement 

Having concluded our investigation into the theoretical restrictions of quantum 
measurement in general, we turn our attention towards a particular example of 
a joint measurement procedure. In this chapter, we study a qubit in interaction 
with the electromagnetic field. By means of homodyne detection, the field- 
quadrature At + aI is observed continuously in time. Due to the interaction, 
information about the initial state of the qubit is transferred into the field, thus 
infiuencing the homodyne measurement results. We construct random variables 
(pointers) on the probability space of homodyne measurement outcomes having 
distributions close to the initial distributions of and ct^. Using variational 
calculus, we find the pointers that are optimal. These optimal pointers are 
very close to hitting the bound imposed by the joint measurement inequality, 
theorem 4.6 in chapter 4. 



5.1 Introduction 

The implementation of quantum filtering and control [Blv88] in recent experi- 
ments [AS+02], [GSM04] has brought new interest to the field of continuous time 
measurement of quantum systems [Dav69], [Dav76], [Blv88], [Blv92b], [Car93], 
[WM93], [BGM04]. In particular, homodyne detection has played a consider- 
able role in this development [Car93]. In this chapter, we aim to gain insight 
into the transfer of information about the initial state of a qubit from this qubit, 
a two-level atom, to the homodyne photocurrent, which is observed in actual 
experiments. Our goal is to perform a joint measurement of two noncommuting 
observables in the initial system. In order to achieve this, we construct random 
variables (pointers) on the space of possible homodyne measurement results, 
having distributions close (in a sense to be defined) to the distributions of these 
observables in the initial state. 
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The problem of joint measurement of noncommuting observables has been 
studied by several authors before, see [Neu32], [Dav76], [Hol82] and the refer- 
ences therein. As a measure for the quahty of an unbiased measurement, we 
use the maximal added variance, i.e. the difference between the variance of the 
pointer in the final state and the variance of the observable in the initial state, 
evaluated in the worst case initial state. (See page 73.) Recall that according 
to theorem 4.6 in chapter 4, these maximal added variances for two pointers, 
corresponding to two noncommuting observables of the initial system, satisfy 
the Heisenberg-like relation (4.7) that bounds how well their joint measurement 
can be performed. 

The present chapter concentrates on the example of a qubit coupled to the 
quantized electromagnetic field. We study this system in the weak coupling 
limit [Gou05], i.e. the interaction between qubit and field is governed by a quan- 
tum stochastic differential cqiiation in the sense of Hudson and Parthasaratliy 
[HP84]. In the electromagnetic field we perform a homodyne detection experi- 
ment. Its integrated photocurrent is the measurement result for measurement 
of the field-quadrature At + Al continuously in time. Using the characteristic 
functions introduced by Barchielli and Lupieri [BL85], we find the probability 
density for these measurement results. In this density the x- and z-component of 
the Bloch vector of the initial state appear, indicating that homodyne detection 
is in fact a joint measurement of ax and cr^ in the initial state. 

Our goal is to construct random variables (pointers) on the probability space 
of homodyne measurement results having distributions as close as possible to 
those of the observables <Tx and in the initial state of the qubit. 'As close 
as possible' is taken to mean that the pointer must give an unbiased estimate 
of the observable, with its maximal added variance as low as possible. Using 
an argument due to Wiseman [Wis96] , we first show that optimal random vari- 
ables will only depend on the endpoint of a weighted path of the integrated 
photocurrent. Allowed to restrict our attention to this smaller class of pointers, 
we are able to use standard variational calculus to obtain the optimal random 
variables. They do not achieve the bound imposed by the joint measurement 
theorem 4.6, but will be off by less than 5.6%. 

The remainder of the chapter is organized as follows. In section 5.2 we 
introduce the model of the qubit coupled to the field in the weak coupling limit. 
In section 5.3, we briefly recall the definition of maximal added variance, as well 
as the Heisenberg-like relation (4.7) for joint measurement. In section 5.4 we 
calculate the characteristic function of Barchielli and Lupieri for the homodyne 
detection experiment. Section 5.5 deals with the variational calculus to find 
the optimal pointers. In section 5.6 we calculate the densities of the optimal 
pointers and then capture our main results graphically. In the last section we 
discuss our results. 

5.2 The Model 

We consider a two-level atom, i.e. a qubit, in interaction with the quantized 
electromagnetic field. The qubit is described by and the electromagnetic 
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field by the symmetric Fock space T over the Hilbert space of quadratically 
integrable functions L^(]R) (space of one-photon wave functions), i.e. 

oo 

J':=ce0i^(]R)®»'=. 

With the Fock space T we can describe superpositions of field-states with dif- 
ferent numbers of photons. The joint system of qubit and field together is 
described by the Hilbert space ® F. 

The interaction between the qubit and the electromagnetic field is studied 
in the weak coupling limit [Gou06] , [Gou05] , [AFL90] . This means that in the 
interaction picture the unitary dynamics of the qubit and the field together 
is given by a quantum stochastic differential equation (QSDE) in the sense of 
Hudson and Parthasarathy [HP84] 

dUt = {cr-dA\ - a+dAt - \a+CF-dt\Ut, (5.1) 













2 


' = (o 





and Uq = 1. The operators C7_ and ct-j- are the annihilator and creator on the 
two-level system. The field annihilation and creation processes are denoted Af, 
and A\, respectively. Keep in mind that the evolution Ut acts nontrivially on 
the combined system ® whereas a± and At are understood to designate 
the single system-operators cr± 1 and 1 (g) A*. Throughout this chapter we 
will remain in the interaction picture. Equation (5.1) should be understood as 
a shorthand for the integral equation 

Ut = l+ I <j_UrdA\ - I a+UrdA^-l [ a+a^Urdr, 
Jo Jo Jo 

where the integrals on the right-hand side are stochastic integrals in the sense of 
Hudson and Parthasarathy [HP84], and Picard iterations can be used to show 
that (5.1) has a unique solution. The value of these integrals does not lie so 
much in their actual definition (see section 2.3.8), but in the Ito rule satisfied 
by them, allowing for easy calculations. The following is a slight extension of 
theorem 2.23. 

Theorem 5.1 (Quantum Ito rule [HP84], [Par92]) Let Xt andYt be quan- 
tum stochastic integrals of the form 

dXt = CtdAt + DtdA\ + Etdt 
dYt = FtdAt + GtdA\ + Htdt 

for some stochastically integrable processes Ct, Dt,Et, Ft, Gt and Ht (see [HP84], 
[Par92] for definitions). Suppose that Xi has an adjoint process, and that all 
the necessary products exist and are stochastically integrable. Then the process 
XtYt satisfies the relation 



d{XtYt) = XtdYt + {dXt)Yt + dXtdYt, 
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where dXfdYt should be evaluated according to the quantum ltd table: 



i.e. dX.dYf = CfGfdt. 





dAt 


dAl 


dt 


dAt 





dt 





dAl 











dt 












, the process f{Xt) satisfies 
(dXt)"^ should be evaluated 



As a corollary we have that, for any f G 
d{f{Xt)) = f'{Xt)dXt + y"{Xt){dXtf, where 
according to the quantum Ito table. 

Let us return to equation (5.1). In order to illustrate how the quantum 
Ito rule will be used, we calculate the time evolution on the qubit explic- 
itly. Wc choose an orthonormal basis of C^, and identify the algebra ,B(C^) 
of qubit-observables with M2(C), the algebra of 2 x 2- matrices. The algebra 
of observables in the field is given by B{J^), the bounded operators on J^. If 
id : M2(C) Af2(C) is the identity map and <f) : B{I') C is the ex- 
pectation with respect to the vacuum state := 1®0®0©... € (i.e. 
(/){¥) := (O I yO)), then time evolution on the qubit Tt : M2(C) M2(C) is 
given by Tt{X) := id (S) (j){U}X (g) lUt). On the combined system, the full time 
evolution jt : M2(C) (g) B{J^) -> M2(C) ® B{J^) is given by jt{W) := U^WUt. 
In a diagram this reads 



M2(C) 



M2(C)(g)B(J-) 



Tt 



M2(C) 



(5.2) 



M2(C)(g)B(J-). 



In the Schrodinger picture the arrows would be reversed. A qubit-state p would 
be extended with the vacuum to p 0, time evolution would change it into into 
p (?) (t){Uj ■ Ut), and in the last step the partial trace over the field would be 
taken, resulting in the state poTf. 

Using the Ito rule we can derive a (matrix- valued) differential equation for 
Tt{X) as follows. We start by using the Ito rule in order to write 



dTt{X) = id O ^{d{U}x O lUt)) 

= id O (l>{{dUl)X lUt + Ulx l{dUt) + {dUl)X l{dUt)) . 



(5.3) 



We then use the fact that stochastic integrals with respect to dA^ and dA^ 
vanish with respect to the vacuum expectation, leaving us only with the dt 
terms. The QSDE for u} easily follows from (5.1); 

dUl = ul^a+dAt - a-dA\ - \a+a-dt^, ul = 1. 
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The first term in equation 5.3 can then be written 

id <p{dU} X (g, lUt) = id i»(t){U}{a+dAt-a-dAl-^a+a-dt}Xi»lUt) 
= id<» (t){Ul{- la+a_dt}X i^lUt) 
= Tt{-l(T+a-X)dt. 

Similarly, the second term yields 

id(S)(j)iUlx (S)ldUt) =Tt{-lXa+a-)dt. 
For the third term, we use the quantum Ito table to see that 

Ul{a+dAt - cF-dA\ - \cF+a-dt]X (g) l{(T-dA\ - a+dAt - \a+cF-dt)Ut 
equals U}{a+Xa- (g) l)Ut dt, so that 

id ® (t){dU} X(^l dUt) = Ti{a+Xa^)dt . 
Combining these three terms, equation (5.3) becomes 

|t,(X)=T,(L(X)), (5.4) 

with L is the Lindblad generator 

L{X) := -^{a+a-X + Xa+a-) + a+Xa-. 

The matrix- valued differential equation (5.4) with initial condition Tq{X) = X 
is solved by Tt{X) = exp{tL){X), which is exactly the time evolution of a 
two-level system spontaneously decaying to the ground state, as it should be. 
(Compare this to the discussion in section 4.5.4, with zero field strength J7.) 
Although the arguments above are completely standard (cf. [HP84]), they do 
illustrate nicely and briefly some of the techniques used also in following sections. 



5.3 Quality of Information Transfer 

Now suppose we do a homodyne detection experiment, enabling us to measure 
the observables Ai + A\ in the field continuously in time [Bac90]. If initially the 
qubit is in state p, then at time t the qubit and field together are in a state p* 
on M2(C) O B(J") given by p\W) := p (t>{UjWUt) = p{id O (l){UlWUt)) . We 
flnd p*(l (g) {Af + A\)) by solving the differential equation 

d (id (g (/.(C/th(g( At +A| )[/()) = id(g</.(d([//l® (At + 4)C/t)) (5.5) 

= id® (o-_ +(T+)(g lt/t)di (5.6) 
= exp{tL){a^ + a+)dt (5.7) 
= e'^a^dt. (5.8) 
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In order to go from (5.5) to (5.6), we used once again the fact that the ground 

state kills all terms with dAf and dAl ■ Using the quantum ltd table to isolate the 
terms with dt, we then expand d{U^l^{At + Al)Ut) into six terms, three of first 
order and three of second order. The dt-terms coming from dU}l(E)iAt+Al)Ut, 
U}l(^{At+Al)dUt and dU}l(>~){At+Al)dUi cancel each other out, and obviously 
U}l(g){dAt+dAl)Ut lacks terms with dt. We arc left with dUll(E){dAt+dAl)Ut 
and U}l «) {dAt + dAl)dUt, which contribute Ul{{a+ + a-) (^l)Ut. 

This shows that regardless the initial state p of the qubit, the expectation 
of {At + AI) in the final state p* will equal the expectation of (2 — 2e~^)ax in 
the initial state p. 

5.3.1 Defining the Quality of Information Transfer 

The process at hand is thus a transfer of information about cTx to the 'pointer' 
{Ai + a]), which can be read off by means of homodync detection. Wc re- 
call definition 4.3 of unbiased measurement in the particular case of a qubit 
measurement by means of the electromagnetic field. 

Definition 5.1 (Unbiased Measurement) Let X be an observable of the qu- 
bit, i.e. a self-adjoint element of M2(C), and let Y be an observable of the 
field, i.e. a self-adjoint operator in (or affiliated to) B{F). Then an unbiased 
measurement M of X with pointer Y is by definition a completely positive map 
M : B{J^) M2(C) such that M{Y) = X. 

Needless to say, for each fixed point t in time, the map M : B{T) M2(C) 
defined by M{B) := id ® (/>(?// 1 ® BUt) is a measurement of with pointer 
F = (2 — 2e~i)~^{At +^t)- This means that, after the measurement proce- 
dure of coupling to the field in the vacuum state and allowing for interaction 
with the qubit for t time units, the distribution of the measurement results of 
the pointer Y has inherited the expectation of ax, regardless of the initial state 
p. However, we are more ambitious and would like its distribution as a whole 
to resemble that of <Jx. Wc recall from section 4.2.2 that the performance of an 
unbiased measurement is gauged by its maximal added variance E^. We will 
call its square root S the quality of measurement. 

Definition 5.2 (Quality) Let M : B{F) — )• M2(C) be an unbiased measure- 
ment of X with pointer Y. Then its quality S is defined by 

:= sup {Var^oM(r) - Wa.Vp{X)\ p e 5(Af2(C))}, 

where 5(M2(C)) denotes the state space o/M2(C) (i.e. all positive normalized 
linear functionals on M2(C)J. 

This means that is the variance added to the initial distribution of X by the 
measurement procedure M for the worst case initial state p. As we have seen. 



Ya.TpoM{Y) - Var^(X) = p{M{Y^) - M{Y f), 
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which impUes that = ||M(y2) _ M{Y)'^\\, where X H> \\X\\ denotes the 
operator norm on M2(C). In particular this shows that is positive, as one 
might expect. It follows from theorem 4.4 that S equals zero if and only if the 
measurement procedure M exactly carries over the distribution of X to Y. In 
short, E is a suitable measure for how well M transfers information about X to 
the pointer Y. 



5.3.2 Calculating the Quality of Information Transfer 

Let us return to the example at hand, i.e. M{B) = id® 4>{ull ® BUt), with 
field-observable Y = {2 — 2e~i)~^{At + A^^) as a pointer for Gx- Let us calculate 
its quality, which amounts to evaluating M{Y'^) {2-2e-iy^M{{Ai+ A\Y). 
To this aim, we will first introduce some ideas which will be of use to us in later 
calculations as well. 

Definition 5.3 Let f and h be real valued functions, h twice differentiable. Let 
Yt be given by dYt = f{t){dAt + dA\), Yq = 0. For X e M2(C) we define 

Fh{X,t) := id®cj){ulX®h{Yt)Ut). 
When no confusion can arise we shall shorten Fh{X,t) to Fh{X). 

The homodyne detection experiment has given us a measurement result (the 
integrated photocurrent) which is just the path of measurement results of A^+aI 
continuously in time. Given this result, we post-process it by weighting the 
increments of the path with the function f{t) and letting h{y) act on the result. 
The following lemma will considerably shorten calculations. 



Lemma 5.2 

dFhjX) 
dt 



= Fh{L{X)) + f{t)Fh^{a+X + XaJ) + \f{tfFh'\X) 



Proof. Just as we did in equation (5.5), we use that the vacuum expectation 

kills all terms with dAi and dA*i. We then expand d{U} X h(Yi)Ui) into three 
first order and three second order terms. Using theorem 5.1 we see that after 
the vacuum expectation, the terms {dUj)X (g) h{Yt)Ut, Ujx (g) h{Yt)dUt and 
{dUj)X h{Yt)dUt make up Fh{L{X))dt. From 

dh{Yt) = h\Yt)f{t){dAt + dAl) + \h"{Yt)f{tfdt, 

we find that, after taking vacuum expectations, the terms d{U})d{X h{Yt))Ut 
and Ujd{X (E) h{Yi))dUi make up the second term f{t)Fiii {d+X + Xa-)dt, and 
U}d{X h{Yt))Ut provides the last term \f{tf Fh'.{X)dt. □ 
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We are now well-equipped to calculate M(^{At + ^t)^)- Choose f{t) = 1 and 



h{x) 



M{{At+Al)^ 



(The maps x n- .t" will be denoted X" hereafter.) Then we have 



dt 



Fx'i{\) and by lemma 5.2 
2F^{a-+a+) + Fi{\) = 2F^{a-+a+) + l, F3e=(l,0)=0. (5.9) 



|F^((T_+a+) + 2Fi(a+a_), Fy,{a_+a+,Q) = Q. (5.10) 



Applying lemma 5.2 to Fx{cf- + cr+), we obtain 
dFx{a- + cr+) _ 

It " "2 

Finally, -Fi(tT+(7_) satisfies 
dFi{a+a-) 



dt 



= -Fi{a+a. 



Fi{a+a-,0) = a+a-. 



(5.11) 



Solving (5.11), (5.10) and (5.9) successively leads first to Fi((T+cr_) = e~*cr+cr_, 
then to Fx{<J- + (j+) = 4(e^2 — e^*)(T+(T_ and finally to Fx^il) = 8(e~2 — 
l)2(7+cr_ + t/. Consequently, the quality of the measurement M of with 
pointer Y = {2 — 2e~^)~^{At + AJ) is given by 

= ||M(F2) _M(y)2|| 

8(e-i - 1)V+CT_ +t/ 



(2-2e-i)2 



2(7+ cr_ + 



t 



t 



(2-2e-5)2 
+ 1. 
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(2-2e-i)2 

This expression takes its minimal value 2.228 a.t t = 2.513, leading to a 
quality S = 1.493. 

The calculation above has an interesting side product. The observable 
M(^{At + AI)"^) depends linearly on cr^, indicating that in addition to infor- 
mation on ax, also information on cr^ in the initial qubit-state ends up in the 
measurement outcome. Indeed, if we use as a pointer 



~ {At+Air-ti _ ^ 

4(e-3-l)2 



(5.12) 



then we have M{Y) = u^, so that M is also a measurement of with pointer Y. 

Note that the pointers Y and Y commute, i.e. measuring A^ + aI via the 
homodyne detection scheme is an indirect joint measurement of ax and az- If we 
would also like to gain some information about ay, we could for example sweep 
the measured quadrature through [0, 2tt) in time by measuring e'"*At + e~'"*^J 
instead. We will however restrict ourselves to continuous time measurement of 
At + AI, as additional information on ay would deteriorate the quality of ax- 
and/or cr^-measurement. We formulate the joint measurement theorem 4.7 in 
the particular case of qubit measurement by means of an electromagnetic field. 
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Theorem 5.3 (Joint Measurement) Let M : B{F) — >■ M2(C) he an unbi- 
ased measurement of self-adjoint ohservahles X G M2(C) and X G M2(C) with 
self-adjoint commuting pointers Y and Y in (or affiliated to) B{J-), respectively. 
Then for their corresponding qualities S and S the following relation holds 

2Et > \\[X,X]\\. 

Denote by S the quality of the measurement with the pointer Y defined in 
(5.12). Since [a-a;,^'^] = —2iay, the quaUties S and S (corresponding to the 
pointers Y and Y, respectively) satisfy the inequaUty 

EE > 1. (5.13) 

Using similar techniques as before, that is recursively calculating ^3^4(1) via 
lemma 5.2, we find 

-2 _ t^ 2t-A{e-i -If 

~ 8(e-l - 1)4 ^ (e-i - 1)2 

This expression takes its minimal value 8.836 at t = 2.513. This leads to a 
quality S = 2.973, which means that EE = 4.437, i.e. wc arc far removed from 
hitting the bound 1 in (5.13). However, there is still some room for manoeuvring 
by post-processing of the homodyne measurement data. 

5.4 The Weighted Path 

Let us presently return to our homodyne detection experiment. We observe 
At + j4|. continuously in time, i.e. the result of our measurement is a path 
Lo of measurement results uJt (the photocurrent integrated up to time r) for 
Ar + AX. This means that we have a space Q of all possible measurement 
paths and that we can identify an operator Ar + A\ with the map from O to M 
mapping a measurement path a; G to the measurement result lo^- at time r. 
That is, we have simultaneously diagonalized the family of commuting operators 
{At -\- A\\ t > 0} and viewed them as random variables on the spectrum fi. 
The spectral projectors of the operators {At + A^] < t < t} endow ft with 
a filtration of cr-algebras {Ej ; t G Furthermore, the states p"^, defined by 

p^iW) :— p(E)(f)(UlWUr) provide a family of consistent measures P,. on (O, E,-), 
turning it into the probability space (f2,E(,P). (See e.g. [Bou04].) 

We aim to find random variables on (fi, Et,P) having distributions resem- 
bling those of ax and in the initial state p. In the previous section we used 
the random variables 

yM = ^ and y(a;) = /I'r ^ - 1, r = 2.513 (5.14) 

2 — 2e 2 4(e 2 — 1)^ 

for (Tj, and cr^, respectively. Our next goal is to find the optimal random vari- 
ables, in the sense of the previously defined quality. 
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5.4.1 Restricting the Claiss of Pointers 

In our specific example, M is given by M{B) = id (8) (f>{Ull (g) BUr)- Note that 
stochastic integrals with respect to the annihilator Ar acting on the vacuum 
vector are zero. Therefore, we can modify Ur to Zt, given by 



without affecting M [Blv92a]. Therefore, for all B e S(J"), we have M{B) = 
\d®(l){Ul\®BUr) = id O (j){Zl\ O BZr). The solution Zt can readily be found, 
it is given by 



Note that Zt, asa matrix valued function of the measurement path, is an element 
of M2(C) ® Ct, where Ct is the commutative von Neumann algebra generated 
by At + A^., < T < t. Moreover we see that Zt is not a function of all the 
{At- + AD's separately, it is only a function of the endpoint of the weighted 
path Yt = /o e-^'^{dAr + dAl) [Wis96]. Therefore if we define St C Ct to be 
the commutative von Neumann algebra generated by Yt, then we even have 
Zt e M2{C)(^St. 

Denote by C E[C|<St] the unique classical conditional expectation from 

Ct onto St that leaves (/> invariant, i.e. (/)(E[C \Si]) = (f>{C) for all C G Ct- We can 
extend E[ • \St] by tensoring it with the identity map on the 2x2 matrices to 
obtain a map id0E[ • \St] from M2(C)0Ct onto M2(C)0»St. From the positivity 
of E[ • \St] as a map between commutative algebras, it follows that id(8)E[ • \St\ 
is completely positive. Since E[ • \St] satisfies E[C5|iSf] = E[C|iSt]S' for all 
C GCt and S G St, we find that id (g) E[ • \St] satisfies the module property, i.e. 



for all Ai,A2 £ M2{C) (g) Si and B e M2(C) (g Ct- Moreover, if p is a state on 
M2(C), then it follows from the invariance of </> under E[ • |iSf] that id(8)E[ • \St] 
leaves p®(f) invariant. We conclude that, given p on M2(C), the map id0E[ • \St] 
from M2(C) ® Ct onto M2(C) (g) St is the unique conditional expectation in the 
noncommutative sense of [Tak71] that leaves p ^ (j) invariant. We will use the 
shorthand Es^ for id (g) E[ • |<S(] in the following. 

Lemma 5.4 Let C G Ct be a pointer with quality such that M{C) — X. 
Then C := E[C \St] is also a pointer with M{C) = X, and its quality is at least 
as good; E^j < Ec- 



dZr = \ a-{dAl + dAr) - ^a+a-dr \Zr, Zq = 1 




id«)E[- \St]iAiBA2) = Ai(id®E[ ■ \St]{B)^A., 



■2, 
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Proof. Note that for all states p on M2(C) we have 

p{M{C)) = p® (j){z}l (g) CZ, 



p(^0(z/E5,(l®C)Zt) 



zll(g)CZt 



= p®(t>{Zll®CZt) 
= P{M{C)). 

In the third step, we used the module property and the fact that Zt is an element 
of M2(C) (g) St, and in the fourth step, we used invariance of p® (p. Since this 
holds for all states p on Af2(C), we conclude that M{C) = M{C) = X. 

As for the variance, we note first that the conditional expectation E^^ is 
a completely positive identity preserving map. Therefore, for all self-adjoint 
C e Ct, we have 

E5,(1«)C") > (e5,(1«)C))^ 

by proposition 4.1. 

We can now apply the same strategy as before. For all states p on M2(C) 
we have 

p{M{C^)) = p®(l,{Z\l®C^Zt) 

= p ® 4,(¥.s,{zll ® C"^ Zt 



> 



= p(^^(^ZlEs,{l®C^)Zt 

(4(E5,(l®C))'Zi) 



Thus M(C2) > M(C'2), so that M(C2) - M{Cf > M{C'^) - M{Cf, and in 
particular > S^. □ 

This has a very useful consequence: if we arc looking for pointers that record, 
say, ax or in an optimal fashion, then it suffices to examine only pointers 
in St- Instead of sifting through the collection of all random variables on the 
measurement outcomes, we are thus allowed to confine the scope of our search to 
the rather transparent collection of measurable functions of Yf. In the following, 
we will look at such pointers ht{Yt). We will usually drop the subscript t on h 
to make the notation lighter. 

5.4.2 Distribution of Yt 

At this point we are interested in the probability distribution of the random 
variable Yt. Its characteristic function [BL85] is given by 

E{k) := Ept [exp{-ikYt)] = p0 ® exp{-ikYt)Ut^ = p {F,^p^_,kx)W) , 
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so that we need only calculate -F'exp(-ife3e)(l)- For notational convenience we 
will replace the subscript exp(--iA:X) by k in the following. Using lemma 5.2, 
we find the following system of matrix valued differential equations: 



d 
dt 
d 



d A;^e~* 

— Ffc(a+cr_) = -Fk{a+C7_) —Fk{a+a_), 

with initial values ^^(1,0) = 1. Fk{(Tx,0) = and i^fe((T+(T_,0) = cr+(T_. 
(Recall that = C7+ + C7_.) Solving this system leads to 

Ffc(l) = e- ''''^^ j 1 _ - e-*)a^ -k'^{l- e"*) ) . 



We define the Fourier transform to be F{f){x) := f{k)e^^^dk. Then 

the probability density of Yt with respect to the Lebesgue measure is given by 
^-^(^)(^) = -^p(t{F^{X)){x)). Defining := -^=^(^^(1)) (x), we 
can write 



i.e. Y[ is distributed according to a Gaussian perturbed by the matrix elements 
of the initial state pia^) and p(cr+fT_) = \p{az) + \- No information about p 
on Uy enters the distribution though. To gain information about <jy we would 
have to change our continuous time measurement setup, as we discussed before. 
If we absorb a constant (1 — e~*)~2 in the definition of Yj, i.e. if we redefine 
It := (1 — e^')^5 J^' e^^idA^ + dA]^), then its density becomes 

-i- 

p{y) = 7^ (i + + HKv^ - 1)^+^^-) , (5.15) 



/27r 

where := \/l — e~*. 



5.5 Variational Calculus 

In lemma 5.4, we have shown that it suffices to consider only random variables of 
the form h{Yt) for some measurable h. In equation (5.15), we have captured the 
probability distribution of Yj. All that remains now is to calculate the optimal 
h, which can be done with variational calculus. 
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5.5.1 Optimal Mecisurement of 

We seek the function h* for which the quality S of the pointer for Ox 

measurement is optimal. In other words, we need 



:= 



/OO . f-OO 

h'^iy)piy)dy- ( h{y)p{y)dy 
-OO J — OO 



di 





d2 



(5.16) 



to be minimal under the restriction h{y)p{y)dy = a^. 

Now E^ is the norm of a diagonal 2 x 2-matrix with entries di and d2- 
Both depend smoothly on h, but = max{di,d2} does not. There are three 
possibilities: 

I) = di in some open neighbourhood of h* . To find these h* , we must 
minimize the smooth functional di and then check whether di < d2. 

II) E^ = d2 in some open neighbourhood of h* . To find these h* , we must 
minimize d2 and check whether c?2 < di. 

Ill) di — d2 for h*. To find these h*, we must minimize di subject to the 
condition di = d2- 

In principle, we need three difi^erent functionals Ai, A2 and A3 for these three 
distinct cases. However, it turns out that we can make due with the following 
functional 

(1 12 
-j= / h?{y)e-'^y^dy-l) + 
\/ Ait J -00 ' 

r h'{y){y^-l)e-'^y'dy) 

a 00 
h{y)e-^y^dyj + 

\ h{y){y^-l)e-'^y'dy) 

J —00 



+ 



(5.17) 



72 



73 h{y)ye-'^y"dy-l). 



The constants 71 , 72 and 73 are the Lagrange multipliers enforcing the identity 
j'^^h{y)p{y)dy = g^. These are needed in all cases: Ai, A2 and A3. One 
can readily check that setting k = in A yields Ai, setting k = 1 yields A2 
and considering Kj clS cl free Lagrange multiplier forces di = d2, so that one has 
A3 = A. 

All three cases lead to similar optimality conditions. The requirement that 
the optimal solution be stable under first order variations yields h* satisfying 
either 

h*{x)= ^\l^^' +Cs (5.18) 



or 



h*{x) 



C^x^ + C^x + Ce 



(5.19) 
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for some real constants Ci, C2, C3, C4, C5, Cq and e depending on k, 71, 72, 73- 
Suppose that h* takes the form (5.19). The constraint h* {y)p{y)dy = Ox 

will then force C4 = Cg = and C5 = so that h*{y) = Pf^y. The 

random variable we are investigating is simply the observed path, weighted by 
the function /(r) = f3^^e^'^^^, with t the final time of measurement. Since 
all the integrals we encounter are Gaussian moments, we can readily compute 
M{h*^Yt)) = J^^h*^y)piy)dy to be 2a+a- + p^H. Thus 

= ||(2a+a_ + P^H) - (a_ + a+ fW = 1 + . 

For t — )• 00, this amounts to S — )• \/2- Already, we have improved on the naive 
result S = 1.493 obtained previously. 

We proceed with the more involved case (5.18), which will provide us with 
the optimal solution. Before we continue with the constants C-i,C2,Cs and e 
however, we calculate some integrals for later use. 

Definition 5.4 Define the error function erf (a;) and integrals I{e) and J{e) by 



erf(a;) := / e " rfu, /(e) := / — dx, J(e) := I 



(x2 + ey 



:dx. 



Lemma 5.5 



Proof. Since the Fourier transform of e v^l*^! is equal to y ^^sqijj we find 

7(e) = Jf / J^(e-v^l'=l)j^(e-V)rfa; 

V J -00 

V 2e 

in view of the unitarity of the Fourier transform. Then 

/(e) = r e-^'^e^'^dk 



e 2 



£ V VV2 



The expression for J then follows from 







dx = ^-p: e 2 ax 



, dx \x'^+ej J_^ \x'^ + e (a;^ + e)^ 

/oo / ^ 2e\ 2 

^ (^-1 + ^ + ^^^^ j e-^d. = + (e - l)/(e) + 2eJ(e). 



□ 



Ill 



The condition h*{y)p{y)dy = a- + a+ = implies 



Cl = -, — : ^, C2 = C3 = 

which fixes Ci as a function of e. The next step is to express di and ^2 in terms 
of s: 

Cl_ p 



(/(£)- £j(£))-l, 



(V2^ - (1 + 2e)I{e) + e{l + e) J(e)) + ^2- 

First, we use lemma 5.5 to express the above in terms of elementary functions 

and the error function. Then, using Maple, we find that e 1—). max{(ii, ^2} has 
a unique minimum at £ = 0.605, for which d\ = d2 = 0.470. This leads to a C\ 
that equals 2.359, and to a quality of 

S = ^/mB3^{dl,d2} = 0.685 . 
5.5.2 Optimal Me£isurement of 

For optimal cr^-measurement. we can run the same program. We search for the 
function h that optimizes the quality S, under the restriction that h{Yt) be a 
pointer for cr^-measurement. That is, we search for a function h minimizing the 
functional of equation (5.16), but now under the restriction h{y)p{y)dy = cTz- 
Again there are three cases of interest, d\_ = d^, di > d2 and d2 > di, which 
we can treat simultaneously by introducing, analogous to equation (5.17), the 
functional 

, 1 f'°° 12 \ 

A(/i,K,7i,72,73) := / h'^iy)e-'^y'dy-l) 

^{^fj'{y){y'-l)e-y'dy) 
(-= / h{y)e-'^y'dy + l) 

, o2 roo 

+ 72 h{y){y' - l)e-'^y dy - 2) 
+ 73 ( / h{y)ye-^y^dy). 



+ 
+ 7i 



Indeed, 71,72 and 73 are the Lagrange multipliers enforcing the restriction 
/oo ^{y)p{y)dy = <^z- Again, the functional Y? of equation (5.16) depends 
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nondifferentiably on h when di = d2- We then have to search for the optimum 
among the points of nondifferentiability, in which case n is the Lagrange mul- 
tiplier confining us to these points. If di > d2 then k = 1 and if d2 > di then 
K = 0. Summarizing, wherever A takes its minimal value, optimality implies 
^(/i, K, 7i, 72, 73) = for some «;,7i,72 and 73. Performing the functional 
derivative yields either 

Mx) = ^4±^+I?3 (5.20) 

or 

h{x) = Dix"^ + D5X + De (5.21) 

for some (time-dependent) constants Di, D2, Ds, D4, D5, De and S depending 
on K,7i,72 and 73. 

Again, we begin with the least demanding case (5.21), resulting from k = Q. 
The condition h(y)p(y)dy = implies = 0, 1)4 = ji^'^ and Dq = 
— 1 — /3|. For t — ^ 00, this leads to 

E2 = \\M{h\Yt))-M(h{Yt)f\\ 
= ||(4a+a_+31)-l|| 
= 6, 

so that S — >■ \/&. 

This improves the result S = 2.973 obtained previously, but once again the 
ultimate bound will be reached in the more arduous case (5.20). There, the 
condition h{y)p{y)dy = implies 



V27r + I{5)D2 
J->s = 



/27r 
2V27f 



= Di{V2^ - 6I{S)) . 

This leads to expressions for di and d2 as a function of S. Using lemma 5.5 

and Maple once more, we find that the function S h- > maxjcJi, ^2} has a Tinique 
minimum at (5 = 2.701, for which di = d2 = 2.373. This leads to a quality of 

E = ^/max{di,d2} = 1.540, 

attained for Di = 0, D2 = —21.649 and D3 = 5.391. For the joint measurement 
this leads to 

ES = 1.056 . 

Although we did not achieve the bound of 1 provided by theorem 5.3, we have 
come as close as the measurement setup allows. We conclude that, using the 
setup investigated in this chapter, no simultaneous measurement of ax and CTz 
will be able to approach the quantum bound by more than 5.6 %. Furthermore, 
we have identified the unique pointers for this optimal measurement in equations 
(5.18) and (5.20). 
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5.6 Distribution of Pointer Variables 

We have designed pointers h*{Yt) and h{Yt) in such a way that their distribu- 
tions in the final state best resemble the distributions of and in the initial 
state. We will now calculate and plot these final densities. 



5.6.1 Calculation of h*- and /i-Densities 

Let p be the initial state of the qubit and let it be parameterized by its Bloch 
vector {Px,Py,Pz)- By equation (5.15), the density q{y) of Yt is given by 

Q{y) = P{p{y)) = ^ (i + PtyP. + ^Uv' - 1)^^)- (5-22) 



We are interested in the the distributions r{x) and s{x) of h*{Yf) and h{Yt) 
respectively. Let us start with h*. Prom equation (5.18), we first calculate the 
points y where h* (y) = x for some fixed value of x. 

y± = ^ 

2x 

By the Frobenius-Peron equation (see e.g. [Ott93]), r{x) is given by 

which leads immediately to 

^ ivl + + Al/±P. + A'(l/i - 1)^) 



+ ,- 

where it is understood that r(x) ^ only for x S ^fe]' ^ similar 

analysis for s{x). The points y in which h{y) = x are given by 



y± = ±\ 

This leads to 



'{x-Ds)S-D2 
Ds-x 



{yl + 5 2 (1 + I3,y^p^ + ^2 y2 _ 1 i^A 

<x) = Y ^ , ^ , (5.24) 



with s{x) 7^ only for x G + ^,0^]. We proceed with a graphical illustra- 
tion of the results obtained so far. 
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5.6.2 Plots of (Tj;- Measurement 

According to formula 5.22, the distribution of the endpoint of the weighted path 

depends on the input qubit-state. For instance, the negative (Jaj-eigenstate, the 
tracial state and the positive a^j-eigenstate lead to the distributions below: 




-5 -4 -3 -2 -1 1 2 3 4 5 

y 





Fig. 15: Probability den- 
sity of the endpoint of the 
weighted path for the input 
state I — >■ ) . 



Fig. 13: Probability den- 
sity of the endpoint of the 
weighted path for the input 
state I ) . 



Fig. 14: Probability den- 
sity of the endpoint of the 
weighted path for the tra- 
cial input state. 



In order to estimate ax, we use the pointer of ax 

given by 

h*{x) = (5.25) 



+ e 

with e = 0.605 and Ci = 2.359. It is illustrated 

to the right. In formula (5.23), we have calculated 
the probability distribution of this pointer under 
the distribution of the endpoint of the weighted 
path. The distribution of the weighted path is 
illustrated above, that of the pointer below. 




Fig. 16: Pointer for ax 
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Fig. 17: Probability density 
of the (Ti,-pointer for the in- 
put state I <— } . 



Fig. 18: Probability density 
of the CTx-pointer for the 
tracial input state. 



Fig. 19: Probability density 
of the (Tii-pointer for the in- 
put state I — > ) . 
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5.6.3 Plots of (T2;- Measurement 

We repeat this for the cr^-pointer. By formula 5.22, the positive (T;j:-eigenstate, 
the tracial state and the negative cr^-eigenstate lead to the distributions of the 
endpoint of the weighted path that are shown below: 





-5-4-3-2-10 1 2 3 4 5 




Fig. 20: Probability den- Fig. 21: Probability den- Fig. 22: Probability den- 
sity of the endpoint of the sity of the endpoint of the sity of the endpoint of the 
weighted path for the input weighted path for the tra- weighted path for the input 
state It). cial input state. state 14-). 




Fig. 23: Pointer for c. 



In order to estimate cr^ , we use the pointer of 
illustrated here to the left. It is given by 



h(x) = 



Do 



8 



(5.26) 



with 5 = 2.701, L»2 = -21.649 and D3 = 5.391. 
From formula (5.24), we read off the probability 
distributions of this pointer under the distribu- 
tions of the endpoint of the weighted path. The 
probability distributions of the weighted path are 
illustrated above, those of the pointer below. 




I I I I I I I I I r I 



I I I I I I I I ITI I I I 



Fig. 24: Probability density 
of the <Tz-pointer for the in- 
put state I t). 



Fig. 25; Probability density 
of the (T^-pointer for the tra- 
cial input state. 



Fig. 26: Probability density 
of the <Tz-pointer for the in- 
put state I 4-). 
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5.7 Discussion 

In this chapter, we have investigated homodyne detection of spontaneous de- 
cay of a two-level atom into the electromagnetic field. We have seen how the 
photocurrent, besides carrying information on Ux (which is immediate from 
the innovations term in the filtering equation), also carries information on Gz- 
Homodyne detection can thus be viewed as a joint measurement of the non- 
commuting observablcs a^, and cr^ in the initial state of the qubit, and we have 
identified the optimal pointers for this procedure in equations (5.25) and (5.26). 

One particular feature of the pointers we constructed might seem counter- 
intuitive at first: they yield values outside [—1,1] with nonzero probability. 
This is a direct result of our requirement that the measurement be unbiased. 
Suppose, for example, that the input state is | f ), so that Oz has value 1. 
Since the photocurrent carries information on Ux as well, its information on az 
is certainly fiawed, and will yield estimates cr^ < 1 at least some of the time. 
Unbiasedness then implies that also estimates cr^ > 1 must occur. 

On the other hand, an unbiased measurement will yield on average the 'true' 
value of az for any possible input state. (Not just for the 3 possibilities sketched 
on page 115.) In repeated experiments, optimality of our pointers guarantees 
fast convergence to these averages. 

Theorem 5.3 provides a theoretical bound for the quality of joint measure- 
ment of Ox and g^- No conceivable measurement procedure can ever achieve 
EE < 1. It is now clear that this bound cannot be met by way of homodyne 
detection: a small part of the information extracted from the atom is simply 
lost in this particular procedure. Constructing the optimal pointers on the pho- 
tocurrent does yield EE = 1.056 however, a figure much closer to the bound 
than the 4.437 provided by the naive choice of (5.14). 
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In this chapter, we propose an adaptive, two step strategy for the estimation of 
mixed qubit states. We show that the strategy is optimal in a local minimax 
sense for the trace norm distance as well as other locally quadratic figures of 
merit. Local minimax optimality means that given n identical qubits, there 
exists no estimator which can perform better than the proposed estimator on 
a neighbourhood of size n^^^^ of an arbitrary state. In particular, it is asymp- 
totically Bayesian optimal for a large class of prior distributions. 

We present a physical implementation of the optimal estimation strategy 
based on continuous time measurements in a field that couples with the qubits, 
not unlike the setup in the previous chapter. 

The crucial ingredient of the result is the concept of local asymptotic nor- 
mality (or LAN) for qubits. This means that, for large n, the statistical model 
described by n identically prepared qubits is locally equivalent to a model with 
only a classical Gaussian distribution and a Gaussian state of a quantum har- 
monic oscillator. 

The term 'local' refers to a shrinking neighbourhood around a fixed state po- 
An essential result is that the neighbourhood radius can be chosen arbitrarily 
close to n~^^^. This allows us to use a two step procedure by which we first 
localize the state within a smaller neighbourhood of radius n~^/^+'^, and then 
use LAN to perform optimal estimation. 



6.1 Introduction 

State estimation is a central topic in quantum statistical inference [IIol82, Hst76, 
BG+03, Hay05]. In broad terms the problem can be formulated as follows: given 
a quantum system prepared in an unknown state p, one would like to reconstruct 
the state by performing a measurement Af whose random result X will be used 
to build an estimator p{X) of p. The quality of the measurement-estimator pair 
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is given by the risk 



Rp{M,p)=E{d{p{X),pf), 



(6.1) 



where d is a distance on the space of states, for instance the fidelity distance 
or the trace norm, and the expectation is taken with respect to the probability 
distribution of X, when the measured system is in state p. Since the risk 
depends on the unknown state p, one considers a global figure of merit by either 
averaging with respect to a prior distribution n (Bayesian setup) 



An optimal procedure in either setup is one which achieves the minimiim risk. 

Typically, one measurement result does not provide enough information in 
order to significantly narrow down on the true state p. Moreover, if the mea- 
surement is "informative" then the state of the system after the measurement 
will contain little or no information about the initial state (cf. chapter 4) and 
one needs to repeat the preparation and measurement procedure in order to 
estimate the state with the desired accuracy. 

It is then natural to consider a framework in which we are given a number 
n of identically prepared systems and look for estimators Pn which are opti- 
mal, or become optimal in the limit of large n. This problem is the quantum 
analogue of the classical statistical problem [Vaa98] of estimating a parameter 
6 from independent identically distributed random variables Xi,. . . ,X„ with 
distribution ¥0, and some of the methods developed in this paper are inspired 
by the classical theory. 

Various state estimation problems have been investigated in the literature 
and the techniques may be quite different depending on a number of factors: the 
dimension of the density matrix, the number of unknown parameters, the purity 
of the states, and the complexity of measurements over which one optimizes. A 
short discussion on these issues can be foimd in section 6.2. 

In this paper we give an asymptotically optimal measurement strategy for 
qubit states that is based on the technique of local asymptotic normality intro- 
duced in [GK06, GJ07]. The technique is a quantum generalization of Lc Cam's 
classical statistical result [LC86], and builds on previous work of Hayashi and 
Matsumoto [Hay04, HM06]. We use an adaptive two stage procedure involv- 
ing continuous time measurements, which could in principle be implemented in 
practice. The idea of adaptive estimation methods, which has a long history 
in classical statistics, was introduced in the quantum set-up by [BNGOO], and 
was subsequently used in [CMOO, Hay02, HM05]. The aim there is similar: 
one wants to first localize the state, and then perform a suitably tailored mea- 
surement which performs optimally around a given state. A different adaptive 
technique was proposed independently by Nagaoka [Nag05] and further devel- 
oped in [Fuj06]. 




(6.2) 



or by considering a maximum risk (pointwise or minimax setup) 



Rm^^{M,p)= sup pRp{M,p). 



(6.3) 
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In the first stage, the spin components a^, (Jy and (T^ are measured sepa- 
rately on a small portion n <C ?t. of the systems, and a rough estimator /5„ is 
constructed. By standard statistical arguments (see lemma 6.1) we deduce that 
with high probability, the true state p lies within a ball of radius slightly larger 
than n~^/^, say with e > 0, centered at /5„. The purpose of the first 

stage is thus to localize the state within a small neighbourhood (up to a unitary 
rotation), as illustrated in the figure below using the Bloch sphere representa- 
tion of qubit states. 




Fig. 27: After the first measurement stage, the state p 
lies in a small ball centered at p„. 

This information is then used in the second stage, which is a joint measure- 
ment on the remaining n — n systems. This second measurement is implemented 
physically by two consecutive couplings, each to a bosonic field. The qubits are 
first coupled to the field via a spontaneous emission interaction and a contin- 
uous time heterodyne detection measurement is performed in the field. This 
yields information on the eigenvectors of p. Then the interaction is changed, 
and a continuous time homodyne detection is performed in the field. This yields 
information on the eigenvalues of p. 

We prove that the second stage of the measurement is asymptotically optimal 
for all states in a ball of radius n~^^'^~^^ around p„. Here rj can be chosen to 
be bigger than e > implying that the two stage procedure as a whole is 
asymptotically optimal for any state, as depicted in the figure below. 




Fig. 28: The smaller domain is the localization region 
of the first step. The second stage estimator is optimal 
for all states in the bigger domain. 

The optimality of the second stage relics heavily on the principle of local 
asymptotic normality or LAN, see [Vaa98j, which we will briefly explain below. 
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and in particular on the fact that it holds in a ball of radius n~^/^+'' around p„ 
rather than just rT^I"^ , as was the case in [GK06]. 

Let po be a fixed state. We parameterize the neighbouring states as /Ou/v«) 
where u = (u^, Uy, Uz) € is a certain set of local parameters around po- Then 
LAN entails that the joint state := p^J^ of n identical qubits converges in 
the limit n — > oo to a Gaussian state of the form iV"(g)(/)", in a sense explained in 
theorem 6.2. By N'^ we denote a classical one-dimensional normal distribution 
centered at ti^ ■ The second term 0" is a Gaussian state of a harmonic oscillator, 
i.e. a displaced thermal equilibrium state with displacement proportional to 
{ux,Uy). We thus have the convergence 

to a much simpler family of classical - quantum states for which we know how 
to optimally estimate the parameter u [Hol82, YL73]. 

The idea of approximating a sequence of statistical experiments by a Gaus- 
sian one goes back to Wald [Wal43] , and was subsequently developed by Le Cam 
[LC86] who coined the term local asymptotic normality. In quantum statistics 
the first ideas in the direction of local asymptotic normality for d-dimensional 
states appeared in the Japanese paper [Hay03], as well as [Hay04] and were 
subsequently developed in [HM06]. In theorem 6.2 we strengthen these results 
for the case of qubits, by proving a strong version of LAN in the spirit of Lc 
Cam's pioneering work. We then exploit this result to prove optimality of the 
second stage. A different approach to local asymptotic normality has been de- 
veloped in [GJ07] to which we refer for a more general exposition on the theory 
of quantum statistical models. A short discussion on the relation between the 
two approaches is given in the remark following theorem 6.2. 




n 



Fig. 29: Total spin representation of the state of n » 1 
spins. The quantum fluctuations of the x and y spin 
directions coincide with those of a coherent state of a 
harmonic oscillator. 

From the physics perspective, our results put on a more rigorous basis the 
treatment of collective states of many identical spins, the keyword here being 
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coherent spin states [HH74]. Indeed, it has been known since Dyson [Dys56] that 

n spin- 1 particles prepared in the spin up state | '|")'^" behave asymptotically 
as the ground state of a quantum oscillator, when considering the fluctuations 
of properly normalized total spin components in the directions orthogonal to z. 
We extend this to spin directions making an "angle" of order n~^/'^'^^ with the 
z axis, as illustrated in the figure above, as well as to mixed states. We believe 
that a similar approach can be followed in the case of spin squeezed states and 
continuous time measurements with feedback control [GSM04]. 

In theorem 6.3 we prove a dynamical version of LAN. The trajectory in time 
of the joint state of the qubits together with the field converges for large n to 
the corresponding trajectory of the joint state of the oscillator and field. In 
other words, time evolution preserves local asymptotic normality. This insures 
that for large n the state of the qubits "leaks" into a Gaussian state of the field, 
providing a concrete implementation of the convergence to the limit Gaussian 
expcrinicnt. 

The punch line of the paper is theorem 6.4 which says that the estimator 
Pn is optimal in local minimax sense, which is the modern statistical formula- 
tion of optimality in the frequentist setup [Vaa98]. Also, its asymptotic risk is 
calculated explicitly. 

The paper is structured as follows: in section 6.2. we show that the first 
stage of the measurement sufficiently localizes the state. In section 6.3, we 
prove that LAN holds with radius of validity n~^^^~^^ , and we bound its rate 
of convergence. Sections 6.4 and 6.5 are concerned with the second stage of the 
measurement, i.e. with the coupling to the bosonic field and the continuous 
time field-measurements. Finally, in section 6.6, asymptotic optimality of the 
estimation scheme is proven. 

The technical details of the proofs are relegated to the appendices in order 
to give the reader a more direct access to the ideas and results. 

6.2 State Estimation 

In this section we introduce the reader to a few general aspects of quantum 

state estimation after which we concentrate on the qubit case. 

State estimation is a generic name for a variety of results which may be 
classified according to the dimension of the parameter space, the kind or family 
of states to be estimated and the preferred estimation method. For an introduc- 
tion to quantum statistical inference we refer to the books by Helstrom [Hst76] 
and Holevo [Hol82] and the more recent review paper [BG+03]. The collec- 
tion [Hay05] is a good reference on quantum statistical problems, with many 
important contributions by the Japanese school. 

For the purpose of this chapter, any quantum state representing a particu- 
lar preparation of a quantum system, is described by a density matrix (positive 
self-adjoint operator of trace one) on the Hilbert space H associated to the sys- 
tem. The algebra of observables is B{H), and the expectation of an observable 
a G B{T-L) with respect to the state p is tY{pa). A measurement M with out- 
comes in a measurable space {X, S) is completely determined by a cr-additive 
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collection of positive self-adjoint operators M{A) on H, where A is an event in 
S. This collection is called a positive operator valued measure (cf. def. 2.6). 
The distribution of the results X when the system is in state p is given by 
Pp(^)=tr(pM(A)). 

We are given n systems identically prepared in state p and we are allowed 
to perform a measurement whose outcome is the estimator /5„ as discussed 
in the introduction. 

The dimension of the density matrix may be finite, such as in the case of 
qubits or d-levels atoms, or infinite as in the case of the state of a monochro- 
matic beam of light. In the finite or parametric case one expects that the 
risk converges to zero as n^^ and the optimal measurement-estimator sequence 
{Mn,Pn) achieves the best constant in front of the n^^ factor. In the non- 
parametric case the rates of convergence are in general slower that because 
one has to simultaneously (^stiniat(^ an infinite number of matrix elements, each 
with rate n^^. An important cxaiiiple of such an estimation technique is that 
of quantum homodyne tomography in quantum optics [VR89]. This allows 
the estimation with arbitrary precision [DLP95, LPA95, LM+96j of the whole 
density matrix of a monochromatic beam of light by repeatedly measuring a suf- 
ficiently large number of identically prepared beams [SB+93, SB+96, ZVB04]. 
In [AGG05, BGA07] it is shown how to formulate the problem of estimating 
infinite dimensional states without the need for choosing a cut-off in the dimen- 
sion of the density matrix, and how to construct optimal minimax estimators 
of the Wigner function for a class of "smooth" states. 

If we have some prior knowledge about the preparation procedure, we may 
encode this by parameterizing the possible states as p = pe with 6 G Q some 
unknown parameter. The problem is then to estimate 6 optimally with respect 
to a distance function on 6. 

Indeed, one of the main problems in the finite dimensional case is to find 
optimal estimation procedures for a given family of states. It is known that if the 
state p is pure or belongs to a one parameter family, then separate measurements 
achieve the optimal rate of the class of joint measurements [Mat02]. However 
for multi-dimensional families of mixed states this is no longer the case and joint 
measurements perform strictly better than separate ones [GMOO]. 

In the Bayesian setup, one optimizes R-^{Mn, Pn) for some prior distribution 
77. We refer to [Jon94, MP95, LPT98, FKFOO, HR+02, BMM02, EN04, BM+05] 
for the pure state case, and to [CEM99, VL+99, MFFOO, KWOl, BB+04, ZS05], 
and [BB+06] for the mixed state case. The methods used here are based on 
group theory and can be applied only to invariant prior distributions and certain 
distance functions. In particular, the optimal covariant measurement in the case 
of completely unknown qubit states was found in [BB+06, IIM06] but it has the 
drawback that it does not give any clue as to how it can be implemented in a 
real experiment. 

In the pointwise approach [Hay02, HMOS, GMOO, BNGOO, FN95, Mat02, 
BG+03, HM06] one tries to minimize the risk for each unknown state p. As 
the optimal measurement-estimator pair cannot depend on the state itself, one 
optimizes the maximum risk -Rmax(-^n, P„), (see (6.3)), or a local version of 
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this which will be defined shortly. The advantage of the pointwise approach is 
that it can be applied to arbitrary families of states and a large class of loss 
functions provided that they are locally quadratic in the chosen parameters. The 
underlying philosophy is that as the number n of states is sufficiently large, the 
problem ceases to be global and becomes a local one as the error in estimating 
the state parameters is of the order n^^''^ . 

The Bayesian and pointwise approaches can be compared [Gil08], and in fact 
for large n the prior distribution tt of the Bayesian approach becomes increas- 
ingly irrelevant and the optimal Bayesian estimator becomes asymptotically 
optimal in the minimax sense and vice versa. 

6.2.1 Qubit State Estimation: the Localization Principle 

Let us now pass to the quantum statistical model which will be the object of 
our investigations. Let p € M2(C) be an arbitrary density matrix describing 
the state of a qubit. Given n identically prepared qubits with joint state p®", 
we would like to optimally estimate p based on the result of a properly chosen 
joint measurement M„. For simplicity of the exposition we assume that the 
outcome of the measurement is an estimator p,,, G Af2(C). In practice however, 
the result X may belong to a complicated measure space (in our case the space 
of continuous time paths) and the estimator is a function of the "raw" data 
Pn ■= Pn{X). The quality of the estimator at the state p is quantified by the 
risk 

Rp{Mn,p„) :=Ep{d{p,Pnf), 

where d is a distance between states. The above expectation is taken with 

respect to the distribution Pp(dx) := tr{pM{dx)) of the measurement results, 
where M{dx) represents the associated positive operator valued measure of the 
measurement M. In our exposition d will be the trace norm (see p. 80) 

IIpi -P2II1 :=tr(|pi -P2I), 

but similar results can be obtained using the fidelity distance. The aim is to find 
a sequence of measurements and estimators (M„,/5„) which is asymptotically 
optimal in the local minimax sense: for any given po 

limsup sup niip(M„, /3„) < lim sup sup nRp{Nn,pn), 

n^oo ||p-pol|i<n"'-/^+^ n->oo || ^ <„-i/2+e 

for any other sequence of measurement-estimator pairs {Nn, Pn)- The factor n is 
inserted because typically Rp{Mn, pn) is of the order 1/n and the optimization 
is about obtaining the smallest constant factor possible. The inequality says 
that one cannot find an estimator which performs better that pn over a ball 
of size centered at po, even if one has the knowledge that the state p 

belongs to that ball! 

Here, and elsewhere in the paper e will appear in different contexts, as a 
generic strictly positive number and will be chosen to be sufficiently small for 
each specific use. At places where such notation may be confusing we will use 
additional symbols to denote small constants. 
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As set forth in the Introduction, our measurement procedure consists of 

two steps. The first one is to perform separate measurements of a^, ay and 
Gz on a fraction n = n{n) of the systems. In this way we obtain a rough 
estimate p„ of the true state p which lies in a local neighbourhood around p 
with high probabihty. The second step uses the information obtained in the 
first step to perform a measurement which is optimal precisely for the states in 
this local neighbourhood. The second step ensures optimality and requires more 
sophisticated techniques inspired by the theory of local asymptotic normality 
for qubit states [GK06] . We begin by showing that the first step amounts to the 
fact that, without loss of generality, we may assume that the unknown state is in 
a local neighbourhood of a known state. This may serve also as an a posteriori 
justification of the definition of local minimax optimality. 

Lemma 6.1 Let Mj denote the measurement of the cxj spin component of a 
qubit with i = x,y, z. We perform each of the measurements Mi separately on 
h/3 identically prepared qubits and define 

p„ = ^(l + m), if |f|<l, 

where f = (f.,.. 7~y, f^) is the vector average of the measured components. If 
\f\ > 1 then we define p„ as the state which has the smallest trace distance to 
the right hand side expression. Then for all e G [0, 2], we have 

P (|1p„ - pII^ > 3n2^-i) < 6exp(-inn2^-i), Vp. 

Furthermore, for any < k < e/2, if n = n^~'^, the contribution to the risk 
— brought by the event E = [ ||p„ — p\\i > \/3n~^/^+^] satisfies 

E ( \\Pn - pWIxe ) < 24exp(-in2-'=) = 0(1). 



Proof. For each spin component tjj we obtain i.i.d coin tosses Xi with distribu- 
tion F{Xi = ±1) = (1 ± ri)/2 and average Tj. 

Hoeffding's inequality [VW96] then states that for all real numbers c > 0, 
we have P(|Xi — > c) < 2cxp(— ific). By using this inequality three times 
once for each component, we get 
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|2 > Sn^^-i < 6exp(-inn2^-i) Vp, 



which implies the statement for the norm distance since ||y5ri^p||i = X^i l^i^^'iP- 
The bound on conditional risk follows from the previous bound and the fact that 

IIP-/5„||? <4. □ 

In the second step of the measurement procedure we rotate the remaining n — h 
qubits such that after rotation the vector f is parallel to the 2;-axis. Afterwards, 
we coTiple the systems to the field and perform certain measurements in the 
field which will determine the final estimator p„. The details of this second step 
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are given in sections 6.4 and 6.5, but at this moment we can already prove that 

the effect of errors in the the first stage of the measurement is asymptotically 
negligible compared to the risk of the second estimator. Indeed by lemma 6.1 we 
get that if n = n^"**, then the probability that the first stage gives a 'Vrong" 
estimator (one which lies outside the local neighbourhood of the true state) 
is of the order exp(— ^n^'^^'') and so is the risk contribution. As the typical 
risk of estimation is of the order 1/n, we see that the first step is practically 
"always" placing the estimator in a neighbourhood of order 71^^/^+' of the true 
state p, as shown in figure 28. In the next section we will show that for such 
neighbourhoods, the state of the remaining n—n systems behaves asymptotically 
as a Gaussian state. This will allow us to devise an optimal measurement scheme 
for qubits based on the optimal measurement for Gaussian states. 

6.3 Local Asymptotic Normality 

The optimality of the second stage of the measurement relies on the concept of 
local asymptotic normality [Vaa98, GK06]. After a short introduction, we will 
prove that LAN holds for the qubit case, with radius of validity n^^/^+'' for all 
rj S [0, 1/4). We will also show that its rate of convergence is C'(n^^/''+''"'"') for 
arbitrarily small e. 

6.3.1 Introduction to LAN and some Definitions 

Let po be a fixed state, which by rotational symmetry can be chosen of the form 

^=('„' (6.4) 

for a given ^ < /i < 1. We parameterize the neighbouring states as Pu/ys where 
u = {ux,Uy,Uz) e M:^ such that the first two components account for unitary 
rotations around pQ, while the third one describes the change in eigenvalues 

Pv:=C/(v)('^7^ l-°-.J^W^ (6-5) 

with unitary U{^^) := exp{i{vxCrx + VyCjy)). The "local parameter" u should 
be thought of, as having a bounded range in or may even "grow slowly" as 

||u|| < n\ 

Then, for large n, the joint state := P®"^ of n identical qubits ap- 
proaches a Gaussian state of the form N'^ ® 0" with the parameter u appearing 
solely in the average of the two Gaussians. By TV" we denote a classical one- 
dimensional normal distribution centered at Uz which relays information about 
the eigenvalues of Pu/y^- The second term 0" is a Gaussian state of a harmonic 
oscillator which is a displaced thermal equilibrium state with displacement pro- 
portional to (Ux-, Uy). It contains information on the eigenvectors of Pu/y^- We 
thus have the convergence 
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to a much simpler family of classical - quantum states for which we know how 

to optimally estimate the parameter u. The asymptotic splitting into a classical 
estimation problem for eigenvalues and a quantum one for the eigenbasis has 
been also noticed in [BB+06] and in [HM06], the latter coming pretty close to 
our formulation of local asymptotic normality. 

The precise meaning of the convergence is given in theorem 6.2 below. In 
short, there exist quantum channels (completely positive maps) r„ which map 
the states P^J^ into iV" ® with vanishing error in trace norm distance, and 
uniformly over the local parameters u. From the statistical point of view the 
convergence implies that a statistical decision problem concerning the model 

can be mapped into a similar problem for tli(^ model ® such that 
the optimal solution for the latter can be translated into an asymptotically 
optimal solution for the former. In our case the problem of estimating the state 
p turns into that of estimating the local parameter u around tlic^ first stage 
estimator p„ playing the role of po- For the family of displaced Gaussian states 
it is well known that the optimal estimation of the displacement is achieved by 
the heterodyne detection [Hol82, YL73], while for the classical part it sufficient 
to take the observation as best estimator. Hence the second step will give an 
optimal estimator u of u and an optimal estimator of the initial qubit state 
Pn '■= Pu/v^- The precise result is formulated in theorem 6.4 

6.3.2 Convergence to the Gaussian Model 

We describe the state g) (f)^ in more detail. TV" is simply the classical 
Gaussian distribution 

:=N{u,,p{l-p)), (6.6) 

with mean Uz and variance p{l — p). 

The state ^" is a density matrix on 'H = J^(C), the representation space of 
the harmonic oscillator. In general, for any Hilbert space f), the Fock space over 
[) is defined (see sec. 2.3.1) as 

oo 

-^(f)) i-SdOa-'-^.f), (6.7) 
n=0 

with ®s denoting the symmetric tensor product. Thus is the simplest 

example of a Fock space. Let 

</.:=(l-p)5]/|fc)(fc| (6.8) 

fe=0 

be a thermal equilibrium state with denoting the fc-th energy level of the 
oscillator and p = < 1. For every a e C define the displaced thermal state 

(f){a) := D{a)(t)D{-a), 

where D{a) := exp(aa^ — aa) is the displacement operator (i.e. the Weyl op- 
erator W{a), cf. sec. 2.3.3), mapping the vacuum vector |0) to the coherent 
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vector 

|a)=exp(-H72)5^^|fc). 

fc=o ^ 

Here and a are the creation and annihilation operators on ^(C), satisfying 
\a,a}\ = 1. The family ^" of states in which we are interested is given by 

^" := 0(^2/i- Ittu), ueM^^ (6.9) 

with a-a := — Uj, + iux- Note that does not depend on Uz- 

We claim that the "statistical information" contained in the joint state of n 
qubits 

■■= P^/'U- (6-10) 

is asymptotically identical to that contained in the couple (A''", More 
precisely: 

Theorem 6.2 Let p" be the family of states (6.5) on the Hilbert space (C^)*^", 
let be the family (6.6) of Gaussian distributions, and let ^" be the family 
(6.9) of displaced thermal equilibrium states of a quantum oscillator. Then for 
each n there exist quantum channels (trace preserving CP maps) 

r„ : r((C2)»") ^ Li(M) ®T{F{C)), 
Sn : L\R) ® T{T{C)) ^ r((C")®") 

with Tin) the trace-class operators on H, such that, for any < 77 < 1/4 and 
any e > 0, 

sup ||iV"®<^"-T„(p;j) 111 = 0(71-1/4+"+^), (6.11) 

||u||<n'> 

sup IIp;^ - Sr^ (iV" r) 111 = 0(n-V4+'7+e). (6.12) 

Moreover, for each 62 > there exists a function f{n) of order C'(n^^/^+''+'^) 
such that the above convergence rates are bounded by f{n), with f independent 
of as long as \ \ — fjb\> 62. 

Remark Note that the equations (6.11) and (6.12) imply that the expressions 
on the left side converge to zero as n — )■ 00. Following the classical terminology 
of Le Cam [LC86| , we will call this type of result strong convergence of quantum 
statistical models (experiments). Another local asymptotic normality result 
has been derived in |GJ07| based on a different concept of convergence, which 
is an extension of the weak convergence of classical (commutative) statistical 
experiments. In the classical set-up it is known that strong convergence implies 
weak convergence for arbitrary statistical models, and the two are equivalent 
for statistical models consisting of a finite number of distributions. A similar 
relation is conjectured to hold in the quantum set-up, but for the moment this 
has been shown only under additional assumptions [GJ07]. 
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These two approaches to local asymptotic normality in quantum statistics 
are based on completely different methods and the results are complementary in 
the sense that the weak; convergence of [GJ07] holds for the larger class of finite 
dimensional states while the strong convergence has more direct consequences 
as it is shown in this paper for the case of qubits. Both results are part of a 
larger effort to develop a general theory of local asymptotic normality in quan- 
tum statistics. Several extensions are in order: from qubits to arbitrary finite 
dimensional systems (strong convergence), from finite dimensional to continu- 
ous variables systems, from identical system to correlated ones, and asymptotic 
normality in continuous time dynamical set-up. 

Finally, let us note that the development of a general theory of conver- 
gence of quantum statistical models will set a framework for dealing with other 
important statistical decision problems such as quantum cloning [Wer98] and 
quantum amplification [Cav82], which do not necessarily involve measurements. 

Remark The construction of the channels r„,S'„ in the case of fixed eigen- 
values {Uz = 0) is given in theorem 1.1 of [GK06]. It is also shown that a similar 
result holds uniformly over ||u|| < C for any fixed finite constant C. In [GJ07], 
it is shown that such maps also exist in the general case, with unknown eigen- 
values. A classical component then appears in the limit statistical experiment. 
In the above result we extend the domain of validity of these theorems from 
"local" parameters ||u|| < C to "slowly growing" local neighbourhoods ||u|| < n'' 
with T] < 1/4. Although this may be seen as merely a technical improvement, 
it is in fact essential in order to insure that the result of the first step of the 
estimation will, with high probability, fall inside a neighbourhood ||u|| < for 
which local asymptotic normality still holds (see figure 28). 

Proof. Following [GK06] we will first indicate how the channels T„ are con- 
structed. The technical details of the proof can be found in appendix A. 

The space (C^)'^" carries two unitary representations. The representation 
7r„ of SU(2) is given by 7r„(7i) = u®" for any u £ SU(2), and the representation 
TTn of the symmetric group S{n) is given by the permutation of factors 

Ti'niT) : Ul (g) • • • (8) I--)- Vr-i(l) • • • (g) Vr-i(n), T € S{n). 

As [7r„(u), 7r„(T)] = for all u S SU(2),t S S{n), we have the decomposition 

n/2 

{cT"= © n,<E^ni. (6.13) 

j=0,l/2 

The direct sum runs over all positive (half)-integers j up to n/2. For each fixed 
j, T-ij = C^-'+^ is an irreducible representation Uj of SU(2) with total angular 
momentum = j{ j + 1), and = C"^ is the irreducible representation of the 
symmetric group S{n) with Uj = („/9_,^) — ■ The density matrix p" 

is invariant under permutations and can be decomposed as a mixture of "block" 
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density matrices 

n/2 

Pn= PnAj)Pln®-- (6-14) 
j=0,l/2 

The probability distribution Pn,u{j) is given by [BB+06]: 

PuAj) ■■= (1 - Mu)^-^ Mr'+' (1 - P'i'^') , (6.15) 

with /Xu := /i + Uzl\/n^ := We can rewrite Pn,u(i) as 

Pn,uU) '■= -Sn,Mu("'/2+ j) X K{j,n,ii,u), (6.16) 

where 

i?n,.(fc) := f ?1 (1 - J^)""' , k = 0,...,n 



is a binomial distribution, and the factor K{j, n, /x, u) is given by 

Ku, n, u) := (1 - p^-^) "^+^^^:!7.^":^>)+!^(.y"-^^ , 

n+{j-3n- Vnu;, + l)//iu 

with jn ■■= n{fi - 1/2). Now K{j,n,fj,u) = 1 + 0(n"^/^+') on the relevant 
values of j, i.e. the ones in an interval of order n^/^+'^ around j„, as long as 
Hu is bounded away from 1/2, which is automatically so for big n. As Bn,fj.^{k) 
is the distribution of a sum of i.i.d. Bernoulli random variables, wc can now 
use standard local asymptotic normality results [Vaa98] to conclude that if j is 
distributed according to Pn,\i, then the centered and rescaled variable 

9n--= -/=- \/n(M- 1/2), 

converges in distribution to a normal , after an additional randomization 
has been performed. The latter is necessary in order to "smooth" the discrete 
distribution into a distribution which is continuous with respect to the Lebesgue 
measure, and will convergence to the Gaussian distribution in total variation 
norm. 

The measurement "which block", corresponding to the decomposition (6.14), 
provides us with a result j and a posterior state The function g„ = f]n{i) 
(with an additional randomization) is the classical part of the channel T„ . The 
randomization consists of "smoothening" with a Gaussian kernel of mean gn{j) 
and variance l/{2^/n), i.e. with t„j := (n^/*/-\/7r) exp (— y^(a; — (?„(j))^). 

Note that this measurement is not disturbing the state p" in the sense that 
the average state after the measurement is the same as before. 

The quantum part of T„ is the same as in [GK06] and consists of embedding 
each block state into the state space of the oscillator by means of an isometry 

: Hi ^ ^(C), ' 

Vj : |j,TO) 1-^ |j - to), 
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where {\j,m) : m = —j, . ■ . ,j} is the eigenbasis of the total spin component 
:= ai^\ cf. equation (5.1) of [GK06]. Then the action of the channel T„ 

is 

■' j 

The inverse channel Sn performs the inverse operation with respect to T„. 
First the oscillator state is "cut-off" to the dimension of an irreducible represen- 
tation and then a block obtained in this way is placed into the decomposition 
(6.13) (with an additional normalization from the remaining infinite dimensional 
block which is negligible for the states in which we are interested). 

The rest of the proof is given in appendix A. □ 



6.4 Time Evolution of the Interacting System 

In the previous section, we have investigated the asymptotic equivalence be- 
tween the states p" and iV" (g) 0" by means of the channel T„. We now seek to 
implement this in a physical situation. The A'^^-part will follow in section 6.5.2, 
the (/)"-part will be treated in this section. 

We couple the n qubits to a bosonic field; this is the physical implementation 
of LAN. Subsequently, we perform a measurement in the field which will provide 
the information about the state of the qubits; this is the utilization of LAN in 
order to solve the asymptotic state estimation problem. 

In this section we will limit ourselves to analyzing the joint evolution of the 
qubits and field. The measurement on the field is described in section 6.5. 

6.4.1 Quantum Stochastic Differential Equations 

In the weak coupling limit, [AFL90, GZ04], the joint evolution of the qubits 
and field can be described mathematically by quantum stochastic differential 
equations (QSDE). See section 2.3 for a short introduction, and [HP84, Par92] 
for a thorough account. The basic notions here are the Fock space, the creation 
and annihilation operators and the quantum stochastic differential equation of 
the unitary evolution. The Hilbert space of the field is the Fock space J"(L^(M)) 
as defined in (6.7). An important linearly complete set in J^(X^(M)) is that of 
the exponential vectors 

■■= ^1/)"' / e (6.17) 

with inner product (e(/) | e(g)) = cxp{{f\g)). The normalized exponential 
states I/) := e"^-'' ' •^^/^e(/) are called coherent states. The vacuum vector is 
:= e(0) and we will denote the corresponding density matrix by $. 

The quantum noises are described by the creation and annihilation martingale 
operators A] := a^{x{o,t]) and At := a(x[o,t]) respectively, where X[o,t] is the 
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indicator function for [0, t] and 

a{f):e{g)^{f\g)eig). 

The increments dA-^ := a{x[o,t+dt\) ~ <3^(X[o,t]) ^^'^ play the role of non- 
commuting integrators in quantum stochastic differential equations, comparable 
to the role of Brownian motion in classical stochastic calculus. 

We now consider the joint unitary evolution for qubits and field defined by 
the quantum stochastic differential equation [HP84, BGM04]: 

dUn{t) = {undAl - al^dAt - ia],o„dt)f/„(i), 

where J7„(i) is a unitary operator on (C^)®" (g) J'(L^(M)), and 

1 " 

a„ := -= ^ cr^*^ , 0-^'=^= 1 • • • [a^ + icTy)/2 • • • 1 , 

k=i 

with jn ■= (m ~ l/2)n. As we will see later, the "coupling factor" of the 

order n~^/^, is necessary in order to obtain convergence to the unitary evolution 
of the quantum harmonic oscillator and the field. 

We remind the reader that the n-qubit space can be decomposed into irre- 
ducible representations as in (6.13), and the interaction between the qubits and 
field respects this decomposition 

n/2 

Un{t)= C/j>(t)0l, 
j=0,l/2 

where 1 is the identity operator on the multiplicity space "H^, and 

Uj,n{t) ■■ Uj ^(^^(M)) ^ Uj J"(l2(]R)), 
is the restricted cocycle 

dUj^riit) = {ajdA\ - a]dAt - ]^a]ajdt)Uj^n{t), (6.18) 

with aj acting on the basis \j, m) of as 

= a/j - (i + TO + l)/2jn li, m + 1), 
a] li, to) = ^/j-m+ l-v/i + TO/2j„ I j, m - 1) . 

Remark We point out that the lowering operator for Lz acts as creator for 
our cut-off oscillator since the highest vector jj, j) corresponds by Vj to the 
vacuum of the oscillator. This choice does not have any physical meaning but 
is only related with our convention ^ > 1/2. Had we chosen fi < 1/2, then 
the raising operator on the qubits would correspond to creation operator on the 
oscillator. 
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By (6.14) the initial state p*^" decomposes in the same way as the unitary 
cocycle, and thus the whole evolution decouples into separate "blocks" for each 
value of j. We do not have explicit solutions to these equations but based on the 
conclusions drawn from LAN we expect that as n — )• oo, the solutions will be 
well approximated by similar ones for a coupling between an oscillator and the 
field, at least for the states in which we are interested. As a warm up exercise 
we will start with this simpler limit case where the states can be calculated 
explicitly. 

6.4.2 Solving the QSDE for the Oscillator 

Let and a be the creation and annihilation operators of a quantum oscillator 
acting on J^(C). We couple the oscillator with the bosonic field and the joint 
unitary evolution is described by the family of unitary operators U {t) satisfying 
the quantum stochastic differential equation 

dU{t) = {adAl - aUAt - ^a''adt)U{t). 

We choose the initial (unnormalized) state tjj{0) := e(z) (g) \fl), where z is any 
complex number, and we shall find the explicit form of the vector state of the 
system and field at time t: ^{t) := U{t)tp{0). 

We make the following ansatz: il){t) = e{at) (8 e(/t), where ft is given by 

/,(.s) := /(s)x[o,t](s) for some / G L'^{R). For each ^ G C, .g G define 
I{t) := {e{l3) (g) e{g) \ ip{t)). We then have I{t) = exp(^a(i) + {g \ ft)), so that it 
satisfies 

dm = {Pia{t) + mf{t)) I{t)dt . (6.19) 

We now calculate ^I{t) with the help of the QSDE. Since the annihilator 
acts as Ate{f) = (x[o,t] | />e(/), we have, for continuous g, dAte{g) = g{t)e{g)dt. 
However, since Ase{ft) is constant for s >t,we have dAte{ft) = 0. Thus 

dl{t) = {e{l3) (g) e{g) \ {adAl - aUAt - ^a'^ adt)ip{t)) , 

so that 

dl{t) = {g{t)a{t) - \Pa{t))I{t)dt . (6.20) 

Equating (6.19) with (6.20) for all t, all /3, and all continuous g, we find 
f{s) = a{s), ia{t) = -^a{t). Thus a{t) = a(0)e-5*, /^(s) = a(0)x[o,t](s)e-5« 
with a(0) = z. In conclusion, 'tp{t) = e(ze~2*)(g)e(ze~2*X[o,t](s))- For later use, 
we denote by ip^{t) := U{t)\z) (g) \fl) = e~l^l^/^f/(i)e(z) ig) the normalized 
solution of the QSDE. 

6.4.3 QSDE for Large Spin 

We consider now the unitary evolution for qubits and field: 

dUn{t) = {andAl - oldAt - ^a'landt)Unit). 
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It is no longer possible to obtain an explicit expression for the joint vector state 
ipn{t) at time t. However we will show that for the states in which we axe 
interested, a satisfactory explicit approximate solution exists. 

The trick works for an arbitrary family of unitary solutions of a quantum 
stochastic differential equation dU{t) = GdtU{t), and the general idea is the 
following. If tp{t) is the true state ip{t) = U{t)tp and ^(t) is a vector describing 
an approximate evolution {tp{0) = ^(0)) then with J7*^^j := U{t + dt)U{t)~^, 
we can write ■^(t + dt) — ^{t + dt) as 

^[t + dt) - ui^^,m + ui+atm - m + m - at + dt) . 

This equals 

u^dt im - m] + [u{t + dt) - uit)]uit)-'m[m - at + at)] , 

so that 

^(i + dt) - at + dt) = ui^dt m) - m] + Gam - dm • 

By taking norms we finally obtain 

d\\m-m\\<\\Gdtm-dmi (6.21) 

The idea is now to devise a family ^t) such that the right side is as small as 
possible. 

We apply this technique block- wise, that is to each unitary Uj^n{t) acting 
on Tij J^(Iy^(M)) (see equation (6.18)) for a "typical" j G Jn (see equation 
(6.39)). By means of the isometry Vj we can embed the space "Hj into the first 
2j + 1 levels of the oscillator and for simplicity we will keep the same notions 
as before for the operators acting on J^(C). As initial states for the qubits we 
choose the block states 

Theorem 6.3 Let = Uj,n{t) [p"„ O $] ?/],„(*) be the j-th block of the 

state of qubits and field at time t. Let (j)^{t) := U{t) (Xi U{t)^ be the joint 
state of the oscillator and field at time t. For any rj < 1/6, for any e > 0, 

sup sup supllpV (i)-(/."(t)||i =(!?(n-VW^,n-i/2+3''+^). (6.22) 

jeJr. ||u||<ni t 

Proof. From the proof of the local asymptotic normality theorem 6.2 we know 
that the initial states of the two unitary evolutions are asymptotically close to 

63jC1i O t llG r 

sup sup \\pl„-r\\i=0{n-'/^+^+^). (6.23) 

jeJn ||u||<n') 

The proof consists of two estimation steps. In the first one. we will devise 
another initial state which is an approximation of ^" and thus also of p"„: 

sup sup |lp^,„-.^"||i=0(e-"'). (6.24) 

jeJr. ||u||<n') 
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In the second estimate we show that the evolved states p"„(i) and (f)^{t) are 
asymptotically close to each other 

sup sup sup ||p"„(t) - = 0(n-l/^+''+^ n-^/^+s^+e^^ (g ^S) 

jeJn \H\<nV t 

This estimate is important, because the two trajectories are driven by different 
Hamiltonians, and in principle there is no reason why they should stay close to 
each other. From (6.23), (6.24) and (6.25), and using the triangle inequality, 
we get 

sup sup SUp||p"„(t)-<?;."(i)||l =C(n"^/''+''+%n-l/2+3r,+e)_ 
jeJn \\u\\<ni t 

The following diagram illustrates the above estimates. The upper line con- 
cerns the time evolution of the block state and the field. The lower line 
describes the time evolution of the oscillator and the field. The estimates show 
that the diagram is "asymptotically commutative" for large n. 

For the rest of the proof, we refer to appendix B. □ 

We have shown how the mathematical statement of LAN (the joint state of 
qubits converges to a Gaussian state of a quantum oscillator plus a classical 
Gaussian random variable) can in fact be physically implemented by coupling 
the spins to the environment and letting them "leak" into the field. In the next 
section, we will use this for the specific purpose of estimating u by performing 
a measurement in the field. 



6.5 The Second Stage Measurement 

We now describe the second stage of our measurement procedure. Recall that 
in the first stage a relatively small part n = n^~'^, 1 > k > 0, of the qubits is 
measured and a rough estimator /}„ is obtained. The purpose of this estimator 
is to localize the state within a small neighbourhood such that the machinery 
of local asymptotic normality of theorem 6.2 can be applied. 

In theorem 6.3 the local asymptotic normality was extended to the level of 
time evolution of the qubits interacting with a bosonic field. We have proven 
that at time t the joint state of the qubits and field is 

P^W ■■= P„.u(i)2^/rfze-l^-^""lV2.= exp(-|z|2)x 

j=0,l/2 

|e(ze-*/2),-) (e(ze-*/2), | ^ \e{ze-^/\[o,t] («))> {e{ze--/\[o,t] {u))\ 
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for ||u|| < n^. The index j serves to remind the reader that the first exponential 
states live in different copies J^{C)j of the oscillator space, corresponding to Hj 
via the isometry Vj. We will continue to identify T-Lj with its image in J^{C)j. 
We can now approximate the above state by its limit for large t, since 

exp(-|zn(e(ze-*/2),.|j,j)(e(ze-"/2x[o,t](«))|e(ze-"/2))=exp(-|zpe-*). 

(6.26) 

As we are always working with ||u|| < rf, the only relevant z are bounded by 
■fiV+^ for small 6. (The remainder of the Gaussian integral has an exponentially 
decreasing norm, as discussed before). Thus, for large enough time (i.e. for 
t > ln(n)), we can write p'^it) = p'^ioo) + e»(n''-i/^+%n3''-V2+«) ^ith 

n/2 

P"(oo) := Pn,u(j)|j,i)(i, j|® 

3=0,1/2 (6.27) 

/ dze-l^-v^«"lV2«^|e(ze-"/2))(e(ze-"/2)|exp(-|zn . 

Thus, the field is approximately in the state 0" depending on (ux, Uy), which 
is carried by the mode [u e^"/^X[o,oo) (m)) € L^(M) denoted for simplicity by 
g-u/2_ rpj^g atoms end up in a mixture of states with coefficients Pn,u(j), 
which depend only on u^, and are well approximated by the Gaussian random 
variable iV" as shown in theorem 6.2. Moreover since there is no correlation 
between atoms and field, the statistical problem decouples into one concerning 
the estimation of the displacement in a family of Gaussian states 0", and one 
for estimating the center of iV". 

For the former problem, the optimal estimation procedure is known to be 
the heterodyne measurement [Hol82, YL73]; for the latter, we perform a 'Vhich 
block" measurement. These measurements are described in the next two sub- 
sections. 

6.5.1 The Heterodyne Measurement 

A heterodyne measurement is a "joint measurement" of the two quadratures 
Q := (a + a})/y/2 and P := -i{ a — a^)/^/2 of a quantum harmonic oscillator 
which in our case represents a mode of light. Since the two operators do not 
commute, the price to pay is the addition of some "noise" which will allow for an 
approximate measurement of both operators. The light beam passes through a 
beamsplitter having a vacuum mode as the second input, and then one performs 
a homodyne (quadrature) measurement on each of the two emerging beams. If 
Qi, and are the vacuum quadratures then we measure the following output 
quadratures Qi := (Q + Q„)/\/2 and P2 := (P - Pt,)/\/2, with [Qi,P2] = 
0. Since the two input beams are independent, the distribution of v^Qi is 
the convolution between the distribution of Q and the distribution of Q„, and 
similarly for y/OF^- 
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In our case we are interested in the mode e "/^ which is in the state 
0", up to a factor of order C'(n''^^/^+^, From (6.9) we obtain 
that the distribution of Q is N{^/2{2ii- l)u^,l/{2{2ii - 1))), that of P is 
N{^/2{2fi — l)uy, l/(2(2/i — 1))), and the joint distribution of the rescaled out- 
put 

((Q + Q„)/v'2(2m - 1) , (P - P,)/v'2(2m-1)) , 

is „ „ 

N{u,, /x/(2(2m - 1)')) X N{uy, 11/ {2{2ti - if)). (6.28) 

We will denote by {ux, Uy) the result of the heterodyne measurement rescaled by 
the factor \/2^ — 1 such that with good approximation [iix^iUy) has the above 
distribution and is an unbiased estimators of the parameters {ux,Uy). 

Since we know in advance that the parameters {ux,Uy) must be within the 
radius of vaUdity of LAN we modify the estimators (ux,Uy) to account for this 
information and obtain the final estimator {Ux,Uy): 

Notice that if the true state p is in the radius of validity of LAN around p, 
then ||u|j < n'', so that \ui — Ui\ < \ui — Ui\. We shall use this when proving 
optimality of the estimator. 

6.5.2 Energy Measurement 

Having seen the (/)"-part, we now move to the A'^"-part of the equivalence be- 
tween p" and A^"(g)(/)". This too is a coupling to a bosonic field, albeit a different 
coupling. We also describe the measurement in the field which will provide the 
information on the qubit states. 

The final state of the previous measurement, restricted to the atoms alone 
(without the field), is obtained by a partial trace of equation (6.27) (for large 
time) over the field 

n/2 

T„" = ^ P„,u(j)b-, J)(J, Jl + 0(n''-V4+^n3'7-V2+e) . 

j=0,l/2 

We will take this as the initial state of the second measurement, which will 
determine j. 

A direct coupling to the does not appear to be physically available, but 

a coupling to the energy is realizable. This suffices, because the above state 
satisfies j = m (up to order 0{n^~^^'^^'^,n^^~^^'^~^'^)). We couple the atoms to 
a new field (in the vacuum state |0)) by means of the interaction 

dUt = {MdAl - dAt) - \Jldt}Ut , 

with Jz := ;^X]fe=i'^z- Since this QSDE is 'essentially commutative', i.e. 
driven by a single classical noise Bt = {A\ — At)/i, the solution is easily seen to 
be 

Ut = exp(J^ ® (4 - At)) . 
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Indeed, we have df{Bt) = f'{Bt)dBt + \f"{Bt)dt by the classical Ito rule, so 
that 

dcxp{iJz iE> Bt) = {iJzdBt - | J^dt} exp(i (g) Bt) . 
For an initial state \j,m) (g) jfi), this evolution gives rise to the final state 

Ut\j,m) (g) = \j,m) (g exp((m/v^)(Aj - At))fl 
= |j,m) (g) |(rn/Vn)x[o,t]), 

where |/) G J-"(L^(K)) denotes the normalized vector exp(— (/| /)/2)e(/). Ap- 
plying this to the states \j, j) {j, j \ in r" yields 

n/2 

UtT^i^^U} = Pn,uU)\jJ)ijJ\'»\j/V^Xlo,t]){j/Vnx[o,t]\ 

3=0,1/2 

+0(„'7-l/4+e^„3r,-l/2+e^_ 

The final state of the field results from a partial trace over the atoms; it is given 
by 

n/2 

E PnAj) l07V^)X[o,t])(07v^)X[o,t]l + 0(n''-i/4+%n3''-i/2+^) . (6.30) 

j=0,l/2 

We now perform a homodyne measurement on the field, which amounts to 
a direct measurement of {At + A|)/2t. In the state |(j7v^X[o,t])) this yields 
the value of j with certainty for large time (i.e. t ^ v^)- Indeed, for this 
state, E((At + A\)l2t) = j/y/n, whereas Var(At + A\)/2t) = Thus the 

probability distribution Pn,\i is reproduced up to order 0{rf~^/^^'^,n^^~^/'^^'^) 
in i-'^-distance. 

The following is a reminder from the proof of theorem 6.2. If we start with 
j distributed according to Pnij) and we smoothen — \/n{iJb — 1/2) with a 
Gaussian kernel, then we obtain a random variable g„ which is continuously 
distributed on M and converges in distribution to N{uz,n{l — A*))) the error 
term being of order 0{n^~^/^) + 0(n*~^/^). For j distributed according to the 
actual distribution, as measured by the homodyne detection experiment, we can 
therefore state that is distributed according to 

N{u„ n{l - m)) + e»(n''-l/^+^ ^ 0(„'7-i/2-) + 0(^^-1/2). (6.31) 

As in the case of (u.,., iiy), we take into account the range of validity of LAN 
by defining the final estimator 

\ if |5„| > Sn". ^ ' 

Similarly, we note that if the true state p is in the radius of validity of LAN 
around p, then ||u|| < n^', so that \uz — Uz\ <\uz ~ Uz\- 
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6.6 Asymptotic Optimality of the Estimator 

In order to estimate the qubit state, we have proposed a strategy consisting of 
the following steps. First, we use n := n^^'" copies of the state p to get a rough 
estimate p„. Then we couple the remaining qubits with a field, and perform 
a heterodyne measurement. Finally, we couple to a different field, followed by 
honiodync measurement. From the measurement outcomes, we construct an 
estimator p„ := Pa^/^- 

This strategy is asymptotically optimal in a global sense: for any true state 
p even if we knew beforehand that the true state p is in a small ball around 
a known state po, it would be impossible to devise an estimator that could do 
better asymptotically, than our estimator /3„ on a small ball around p. More 
precisely: 

Theorem 6.4 Let Pn be the estimator defined above. For any qubit state po 
different from the totally mixed state, for any sequence of estimators the 
following local asymptotic minimax result holds for any < e < 1/12.- 

limsup sup ni?(p, /3„) < limsup sup nR{p,Qn)- (6.33) 

n-)-oo ||p-po||i<«-^''^+" ||p-po||i<n-^/^+' 

Let {fio, 1 — Po) be the eigenvalues of po with no > 1/2. Then the local asymp- 
totic minimax risk is 

limsup sup nR{p,Pn) = -Rininimax(/"o) = S/Uq - 4/io. (6.34) 

n^-oo ||p-pol|i<n-^/^+' 

Proof. We write the risk as the sum of two terms corresponding to the events E 
and E"^ that p„ is inside or outside the ball of radius n~^^^~^'^ around p. Recall 
that LAN is valid inside the ball. Thus 

R{p, M = E{\\p - pX Xec) + mp - PnWl Xe), 

where the expectation comes from /5„ being random. The distribution of the 
result pn of our measurement procedure applied to the true unknown state p 
depends on p. We bound the first part by Ri and the second part by R2 as 
shown below. 

Ri equals V{E'^) times the maximum error, which is 4 since for any pair of 
density matrices p and a, we have \\p — a\\l < 4. Thus 

i?i=4P(||p-p„||i>n-V2+^). 

According to lemma 6.1 this probability goes to zero exponentially fast, therefore 
the contribution brought by this term can be neglected. 

We can now assume that p„ is in the range of validity of local asymptotic 
normality and we can write p**" = with u the local parameter around p„. 
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We get the following inequalities for the second term in the risk. 



n\\p-Pn\\lxE) < E 
< 



P\\l 



sup E 

p-po\\<n--^/^+- 



Pn - P\\l 



EpU(oo) 



Pn = PO 

\\Pn-p\\l 



Pn = PO 



< sup 

||^,-po||<n-l/2 + e 

+ sup \\p^{t) - p'^{oo)\\i sup \\pn- p\\l 

||p-po||<n-i/^+« «n 

\pn-p\\l 



Epu(oo) 



Pn = PO 



< sup 

||p-poll<"-l/2 + . 

+0^-1+2" sup mt)-Pni00)h 

\\p-po\\<n-y^+^ 

= R2. (6.35) 



The first two inequalities are trivial. In the third inequality we change the 

expectation from the one with respect to the probability distribution of our data 
Ppu((') to the probability distribution F^m^^^y In doing so, an additional term 
||Ppu(t) — Ppu(oo)||i appears which is bounded from above by ||/o"(t) — p"(oo)||i. 
In the last inequality we can bound ||/5„ — p\\1 by cn~^~^'^'^ for some constant c. 
Indeed from definitions (6.29) and (6.32) we know that — po||i < c'n~^^^^^ 
and additionally we are under the assumption Hp — poHi < n"^/^"*"^ with e < 77. 

For the following, recall that all our LAN estimates arc valid uniformly 
around any state p° = p as long as /x — 1/2 > €2 > 0. As we are working with p 
different from the totally mixed state and ||p — p\\ < n~^/^+^, we know that for 
big enough n, /i — 1/2 > £2 for any possible p. We can then apply the uniform 
results of the previous sections. 

The second term in R2 is 0{n-^/'^+^''+^ ,n-^/'^+^i+^) where ^ > can be 
chosen arbitrarily small. Indeed, in the end of section 6.4, wc have proven that 
after time t > Inn, we have {{p'^it) - /9"(oo)||i = 0{n-^/'^+'^+\n-'^/^+^'i+^). 
The contribution to nR{p, Pn) brought by this term will not count in the limit, 
as long as 77 and e are chose such that 1/12 > r] > e. 

We now deal with the first term in R2. We write p in local parameterization 
around po = p as p^^/^- We have 

\\Pn-p\\l = WPu/^- Pa^/vrMi 

^ {uz - u^f + (2/x - 1)^((mx - u^f + {uy - iiyf) 
n 

+ 0(||u-u„||^n-3/'). (6.36) 

The remainder term Odju— u„||'^n^'^/^) is negligible. It is 0{rt''^^^/'^) which 
does not contribute to nR{p,pn) for 77 < 1/6. This is because on the one hand 
we have asked for ||p„ — pj| < n^^/^+'^, and on the other hand, we have bounded 
our estimator u„ by using (6.29) and (6.32). 
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We now evaluate Epu(oo) [rf(u, u„)^], where (i(u, u„)^ is defined as 

d(u, v)2 := 4 [{u, - v,f + {2n - - v^f + {uy - Vyf)] . (6.37) 

Note that the risk of u„ is smaller than that of u„ (see discussion below (6.29) 
and (6.32)). Under the law Pp^(oo) the estimator u„ has a Gaussian distribu- 
tion as shown in (6.28) and (6.31) with fixed and known variance and unknown 
expectation. In statistics this type of model is known as a Gaussian shift exper- 
iment [Vaa98]. Using (6.28) and (6.31), we get Epu(oo) [{uz — Uz)^] < m(1 ^ m) 
andEpu(co) [(^i — Wi)^] < /x/(2(2/i— 1)^) for i = a;, y. Substituting these bounds 
in (6.36), we obtain (6.34). 

We will now show that the sequence Pn is optimal in the local minimax 
sense: for any po and any other sequence of estimators Qn we have 

Ro = limsup sup nR{p, Qn) > 8fj,o — 4/Uo- 

n->-co \\p-po\\i<n-'^/^+^ 

We will first prove that the right hand side is the minimax risk i?minimax(A*o) 
for the family of states iV" ig) (/>" which is the limit of the local families of 
qubit states centered around po- We then extend the result to our sequence of 
quantum statistical models /?". 

The minimax optimality for TV" (g) (/>" can be checked separately for the 
classical and the quantum part of the experiment. For the quantum part 0", 
the optimal measurement is known to be the heterodyne measurement. A proof 
of this fact can be found in lemma 7.4 of [GK06]. For the classical part, which 
corresponds to the measurement of L^, the optimal estimator is simply the 
random variable X ^ A^" itself [Vaa98] . 

We now end the proof by using the other direction of LAN. Suppose that 
there exists a better sequence of estimators Qn such that 

^0 < ^minimax (Mo) = 8/io " 

We will show that this leads to an estimator fi of u for the family A" (g) 0" 
whose maximum risk is smaller than the minimax risk i?minimax(/^o)i which is 
impossible. 

By means of a beamsplitter one can divide the state cp^ into two independent 
Gaussian modes, using a thermal state cp := 0" as the second input. If r and t are 
the reflectivity and respective transmittivity of the beamsplitter (r^ + 1'^ = 1), 
then the transmitted beam has state = 0*" and the reflected one 0"^ = 
0''". By performing a heterodyne measurement on the latter, and observing the 
classical part A^", we can localize u within a big ball around the result u with 
high probability, in the spirit of lemma 6.1. More precisely, for any small e > 
we can find a > big enough such that the risk contribution from unlikely u's 
is small 

E(||u-afx||u-u||>a)<?- 

Summarizing the localization step, we may assume that the parameter u satisfies 
||u|| < a with an e loss of risk, where a = a{r, e). 
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Now let n be large enough such that if > a, then the parameter u falls 
within the domain of convergence of the inverse map Sn of theorem 6.2 and by 
(6.12) (with e replacing rj and 6 replacing e) we have 

IIp^" - S(iV*" ® 0*")||i < C7n-i/^+^+^ 

for some constant C. 

Next we perform the measurement leading to the estimator g,!, and cquiv- 
alently to an estimator u„ of u. Without loss of risk we can implement the 
condition ||u|| < a into the estimator u„ in a similar fashion as in (6.29) and 
(6.32). The risk of this estimation procedure for is then bounded from 
above by the sum of three terms: the risk nRp{gn)/t^ coming from the qubit 
estimation, the error contribution from the map Sn which is a'^n~^/^~^'^~^^ , and 
the localization risk contribution e. This risk bound uses the same technique 
as the third inequality of (6.35). The second contribution can be made ar- 
bitrarily small by choosing n large enough, for e < 1/4. From our assump- 
tion we have Rq < i?minimax(/^o) ^-^id we can choose t close to one such that 
Ro/t"^ < i?minimax(A*o) and further choose e such that Ro/t'^+e < i?minimax(Ato)- 

In conclusion, we get that the risk for estimating u is asymptotically smaller 
that the risk of the heterodyne measurement combined with observing the clas- 
sical part which is known to be minimax [GK06]. Hence no such sequence gn 
exists, and p„ is optimal. □ 

Remark In theorem 6.33, we have used the risk function R{p, p) = E(d^ {p, p) ) , 
with d the Li-distance d{p, p) = \\p — p\\i. However, the obtained results can 
easily be adapted to any distance measure dP{pii,pu) which is locally quadratic 
in u — u, i.e. 

d^{pix,Pxx)= X! lc.l3{Ua-K){Ui3-Uij) + 0{\\u-uf). 

For instance, one may choose d'^{p,p) = 1 — F'^(p,p) with the fidelity 
F{p,p) := tr(-\/v^^PV^)- For non-pure states, this is easily seen to be locally 
quadratic with 

(2/xo - 1)2 

7=1 (2mo - 1)2 





l-{2no-ir 

For the corresponding risk function Rp{p,pn) := E(l — F^{p,pn)), this yields 
limsup sup ni?ir(p, Pn) = Mo + 1/4, (6.38) 

n->-oo ||p-pol|i<n-^''^+' 

with the same asymptotically optimal p. The asymptotic rate Rp ~ '^'^4n^ 
found earlier in [BB+06], using different methods. 
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6.7 Conclusions 

In this chapter, we have shown two properties of quantum local asymptotic 
normality (LAN) for qubits. First of all, we have seen that its radius of validity 
is arbitrarily close to n"^/** rather than And secondly, we have seen how 

LAN can be implemented physically, in a quantum optical setup. 

We use these properties to construct an asymptotically optimal estimator 
Pn of the qubit state p, provided that we are given n identical copies of p. 
Compared with other optimal estimation methods [BB+06, HM06], our mea- 
surement technique makes a significant step in the direction of an experimental 
implementation. 

The construction and optimality of p„ are shown in three steps. 

I In the preliminary stage, we perform measurements of Ux, <Jy and Oz on 
a fraction n = n^~'^ of the n atoms. As shown in section 6.2, this yields a 
rough estimate /?„ which lies within a distance n"^/^"*"^ of the true state 
p with high probability. 

II In section 6.3, it is shown that local asymptotic normality holds within 
a ball of radius around p {rj > e). This means that locally, in 

the limit n — >^ cxd, all statistical problems concerning the n identically pre- 
pared qubits are equivalent to statistical problems concerning a Gaussian 
distribution and its quantum analogue, a displaced thermal state </>" 
of the harmonic oscillator. 

Together, I and II imply that the principle of LAN has been extended to 
a global setting. It can now be used for a wide range of asymptotic statis- 
tical problems, including the global problem of state estimation. Note that 
this hinges on the rather subtle extension of the range of validity of LAN to 
neighbourhoods of radius larger than n~^/^. 

Ill LAN provides an abstract equivalence between the n-qubit states P®J^ 
on the one hand, and on the other hand the Gaussian states A^" (g) (/)". 
In sections 6.4 and 6.5 it is shown that this abstract equivalence can be 
implemented physically by two consecutive couplings to the electromag- 
netic field. For the particular problem of state estimation, homodyne and 
heterodyne detection on the electromagnetic field then yield the data from 
which the optimal estimator /3„ is computed. 

Finally, in section 6.6, it is shown that the estimator /5„, constructed above, 
is optimal in a local minimax sense. Local here means that optimality holds 
in a ball of radius slightly bigger than around any state po except the 

tracial state. That is, even if we had known beforehand that the true state lies 
within this ball around po, we would not have been able to construct a better 
estimator than /)„, which is of course independent of po- 

For this asymptotically optimal estimator, we have shown that the risk R 
converges to zero at rate R{p, Pn) ~ 8wl_Vo^ with /xq > | an eigenvalue of p. 
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More precisely, we have 

limsup sup nR{p,pn) = 8/io — ^fJ-o- 

n->-co ||p-pol|i<n-^''^+' 

The risk is defined as R{p, p) ~ E((i^(p, p)), where we have chosen d{p, p) to be 
the Li-distance ||/5 — p||i := tr{\p — p\). This seems to be a rather natural choice 
because of its direct physical significance as the worst case difference between 
the probabilities induced by p and p on a single event. 

Even still, we emphasize that the same procedure can be applied to a wide 
range of other risk functions. Due to the local nature of the estimator /5„ for 
large n, its rate of convergence in a risk R is only sensitive to the lowest order 
Taylor expansion of R in local parameters u — u. The procedure can therefore 
easily be adapted to other risk functions, provided that the distance measure 
(P{pii,Pu) is locally quadratic in u — u. 

Remark The totally mixed state (/i = 1/2) is a singular point in the pa- 
rameter space, and theorem 6.2 does not apply in this case. The effect of the 
singularity is that the family of states (6.9) collapses to a single degenerate state 
of infinite temperature. However this phenomenon is only due to our partic- 
ular parameterization, which was chosen for its convenience in describing the 
local neighbourhoods around arbitrary states, with the exception of the totally 
mixed state. Had we chosen a different parameterization, e.g. in terms of the 
Bloch vector, we would have found that local asymptotic normality holds for 
the totally mixed state as well, but the limit experiment is different: it consists 
of a three dimensional classical Gaussian shift, each independent component 
corresponding to the local change in the Bloch vector along the three possible 
directions. Mathematically, the optimal measurement strategy in this case is 
just to observe the classical variables. However this strategy cannot be im- 
plemented by coupling with the field since this coupling becomes singular (see 
equation (6.18)). These issues become more important for higher dimensional 
systems where the eigenvalues may exhibit more complicated multiplicities, and 
will be dealt with in that context. 



A Appendix: Proof of Theorem 6.2 

Here we give the technical details of the proof of local asymptotic normality 
with "slowly growing" local neighbourhoods ||u|| < n^, with t] < 1/4. We start 
with the map T„. 



A.l The Map r„ 

Let us define, for < e < (1/4 — 77) the interval 

Jn = {j ; (m - l/2)n - ni/2+^ < j < [p - l/2)n + n^/^+e | (5.39) 

Notice that j G Jn satisfies 2] > for all p, — 1/2 > 62 smd n big enoTigh, 
independently of p. Then J7„ contains the relevant values of j, uniformly for 
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M - 1/2 > £2: 



lim PnAJn) = 1 - 0(n-V2+^). 



(6.40) 



This is a consequence of HoefFding's inequality applied to the binomial distri- 
bution, and recalling that p„,u(i) = B{n/2+j){l + 0{n~^/^+'')) for j e J„. 
We upper-bound ||r„(p;j) - iV" (g) (^"|| by the sum 



3E 



+ sup \\V,plV^ 



jeJn 



(6.41) 



The first two terms are "classical" and converge to zero uniformly over ||u|| < n''. 

For the first term, this is (6.40), while the second term converges uniformly on 
/X — 1/2 > €2 at rate n^~^^^ [KG09]. The third term can be analyzed as in 
proposition 5.1 of [GK06]: 



< 



V]rv, +W-PjrPA\„ (6.42) 



where Pj := VjV^ is the projection onto the image of Vj. We will show that 
both terms on the right side go to zero uniformly at rate over j € Jn 

and ||u|| < "riP . The trick is to note that displaced thermal equilibrium states 
are Gaussian mixtures of coherent states 



^/2 



-|z-V2M-laur/2s^ 



|z)(z|)d2z, 



(6.43) 



where := (1 - A')/(4y^ - 2). 

The second term on the right side of (6.42) is bounded from above by 



= J e-l^-«""l^/2^^|||z)(z|-P,|z)(z|P,||id2z, 
which after some simple computations can be reduced (up to a constant) to 

e-\-\'/^^'\\pJ-\z + ^/2j:^a^)\\d^z. (6.44) 



We now split the integral, the first part is integrating over |z| > n''+* with 
< S < 1/4 — 77/2. The integral is dominated by the Gaussian and its value 



is 0{e-" 



2(r, + 5) 



). The other part is bounded by the supremum over |z| < 



271"+* (as ||u|| < n") of |lP/|z)||. Now l|P/|z)l| < |z|VVj! = 0(e-"(V2-r)-25)-) 
uniformly on j € J^n, for any /i — 1/2 > £2 since then 2j > e^n. 
The same type of estimates apply to the first term: 



Pn,j 



+ 



Ad 



Ad 









vj<p°vj)-vjrv, 



< 



(6.45) 
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The first term on the right side does not depend on u. From the proof of lemma 
5.4 of [GK06], we know that 



< 



P' 

1 - p2j+ 



1+P 



2i+i 



with p = (1 — //)///. Now the left side is of the order p^^'^^ which converges 
exponentially fast to zero uniformly on /x — 1/2 > £2 and j € Jn- 

The second term of (6.45) can be bounded again by a Gaussian integral 

_L_ y e-l^lV2«^||A(u,z,i)|Ud2z, (6.46) 

where the operator A(u,z, j) is given by 

A(u,z,i) := Ad[c/,(^)] (l^t|z)(z|y,.) 

-V] |z + y2/i- lau)(z + y2/i- laul V^. 

Again, we split the integral along ||z|| > ■riP^^ . The outer part converges to 

zero faster than any power of n, as we have already seen. The inner integral, 
on the other hand, can be bounded uniformly over ||u|| < -nP , \i— 1/2 > ti and 
3 S Jn by the supremum of ||A(u, z, j)||i over |z| < 2n''+^, /x— 1/2 > £2, j e Jn 
and ||u|| < -rf. 

Let the vector z e be such that cti = iij yJ2\i — 1, and introduce the 
notation 'ip{n,j,v) = VjUj{v/y/n)\j,j). Then, up to a \/2 factor, ||A(u, z, j)||i 
is bounded from above by 



+ 



ip{n,j,z) - |z) 
V'(n, j, u + z) - |z + \/2/Lt - loiu) 

(^) (^) - ( w) l^'^) 



+ 



(6.47) 



This is obtained by adding and subtracting the states \'ip{n, j, z)) (■)/'("-, J, z) | and 
|^/^(n, j, u + z))(f/;(n, j, u + z)| and using the fact that — l</')('/*l l|i = 

\/2||'!/' — ^11 for normalized vectors V, 

The first two terms are similar, we want to dominate them uniformly. We 
replace u + z by z with |z| < 2n''+'', and then write: 



||V(n,i,z)-|z)||^ = ^Kfc|V(n,i,z))-(fc|z)|^ 



fe=0 
r-l 



< 



Y,\{m{n,3,i))-{kW 



fe=0 



+2X:(IW(n,j,z))p + Kfc|z)|2) . (6.48) 



k=r 
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If z = |z|e'^ then we have [HM06] 

(fc|V(n,i,z)) = y^(sin(|z|/V^)e^^)'(cos(|z|V^))'^-\ 



(A;|z) = exp ( — 



(2m-1)N2\ (e^«|z|y2^)' 



In (6.48), we choose r = n^''+^^ with €3 such that it satisfies the conditions 
2(5 + 2r/ + e < 2?7 + £3 + e < 1/2 and rj + e^ < 1/4. The tail sums are then of the 
order 



k=r 



C(exp(-n2''+^3)) 



fc 



fe=r fc=r ^ / w / 

= 0(exp(-n2''+'==^)). 



For the finite sums wc use the following estimates which are uniform over all 

|z| < 2^"+^ k<r,je Jw- 



(sin(|z|/V^))'= = (|z|/V^)'=(l + 0(n4"+^3+25-i))^ 

(cOs(|z|/\/^))2^-'= = exp ^^^^2„-l/2+e+25^)^ 

where we have used on the last line that (1 + x/n)^ = exp(x)(l + 0{n~^/'^x)) 
for X < n^/^"^" (cf. [KG09]). This is enough to show that the finite sum 

converges uniformly to zero at rate (at least if £3 is small 

enough). The first two terms in (6.47) decrease as the square root of this, that 
is C»(n''-i/4+^/2+«3/2)_ 

Notice that the error terms depend on // only through j, and that 2j > en 
for — 1/2 > e2. Hence they are uniform in 11. 

We now pass to the third term of (6.47). By direct computation, it can 
be shown that if we consider two general elements exp(iXi) and exp(iX2) of 
SU(2), with Xi self-adjoint elements of M{C^), then 

g-i(Xi+x.)gi^igiX.g[Xi,^.l/2 ^ ^ 0{Xi,Xi,Xi,), (6.49) 

where the O(-) contains only third order terms in Xi,X2. If Xi,X2 are in 
the linear span of and ay then all third order monomials are such linear 

combinations as well. 
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In particular we get that for z, u < 

■■= u[-^)u[^)u[^)cxpiiiu,vy-uyv,)a./n) 

1 + 0(n-2+4';+4^3) 0(„-3/2+3^+3e3) 
0(^-3/2+3^+363 ) 1 _^ 0(^-2+4^+4^3 ; 



(6.50) 



Finally, using the fact that is an eigenvector of L^, the third term in 
(6.47) can be written as 

\\\3,jHj,j\-uM\jJ){jJ\uMH\ 

and both states arc pure, so it suffices to show that the scalar product converges 
to one uniformly. Using (6.50) and the expression of {j\Uj{(3)\j) [HM06] we get, 
as j < n, 

(jJlUjiPUj) = [UiPKiY = 1 + o{n-'+'^+''% 

which implies that the third term in (6.47) is of order C'(n~-'^+^''+^'^-^). By 
choosing €3 and e small enough, we obtain that all terms used in bounding 
(6.46) are uniformly 0{n-^/'^+'^+'') for any e > 0. 

This ends the proof of convergence (6.11) from the n qubit state to the 
oscillator. 

A.2 The Map Sn 

The opposite direction (6.12) does not require much additional estimation, so 

we will only give an outline of the argument. 

Given the state (E) (j)^, we would like to map it into p" by means of a 
completely positive map Sn, or at least come as close as possible. 

Let X be the classical random variable with probability distribution A/'". 
With X we generate a random j e Z as follows 

j{X) = [V^X + n{fx-l/2)]. 

This choice is evident from the scaling properties of the probability distribution 
p" which we want to reconstruct. Let be the probability distribution of j(X). 
By classical local asymptotic normality results we have the convergence 

sup \\q^-p^\\^^Oin"-'/'). (6.51) 

l|u||<"'' 

Now, if the integer j is in the interval jT'n, then we prepare the n qubits 
in block diagonal state with the only non-zero block corresponding to the j 
irreducible representation of SU(2): 

<J--= (vS^«+tr(P,^<A")l)®^. 

The transformation 0" 1— )■ r " ■ is trace preserving and completely positive [GK06]. 
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If j ^ Jn then we may prepare the qubits in an arbitrary state which we 
also denote by r"^. The total channel Sn then acts as follows 

n/2 

S„ : iV" ® ^ r„" := 

3=0,1/2 

We estimate the error ^ '''"Hi as 

Hp" - < Ikn -Kill + 2Ppu(j i j„) + sup IK,. - pijiu 

jeJn 

The first term on the r.h.s. is 0(n''~^/^) (see (6.51)), the second term is 
0(n^~^/^) (see (6.40)). As for the third term, we use the triangle inequality 
to write, for j G Jn, 



T • — 



The first term is 0{e "(^/^ '7 25)^^ according to the discussion following equa- 
tion (6.44). The second term on the right is £)(n~^/^+''+^) according to equa- 
tions (6.45) through (6.50). 

Summarizing, we have ||5„(iV" O 0") - p^||i = C>(n-V4+'7+«)^ ^hich estab- 
lishes the proof in the inverse direction. □ 



B Appendix: Proof of Theorem 6.3 

We split the proof of theorem 6.3 in two steps. 
B.l First Estimate 

We build up the state p^^j by taking linear combinations of number states |m) to 
obtain an approximate coherent state |z), and finally mixing such states with a 
Gaussian distribution to get an approximate displaced thermal state. Consider 
the approximate coherent vector Pm|z), for some fixed z g C and m = rf, with 
7 to be fixed later. Define the normalized vector 



|«'=P«£^|'"'- ''''' 

We mix the above states to obtain 

Pl„ := J e-l-v^-lV2.^ (ICj)(C,,l) rf^z. 

Recall that = (1 - /u)(4/i - 2), and 
v27rs2 J 
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From the definition of |V'z,j) we have 

|||C,,)-|z)||<x/2l 



I m 



'ml 



A 2, 



(6.53) 



which implies that 
Indeed, we bound it by 



V2 



1 = 0{e ) for any e > 0, for any 7 > 2{'q+e). 

z + ^J2iJL - lau| 



^1 



'TO! 



A\/2 d^z 



and split the integral into two parts. The integral over the domain |z| > rf'^^ 



is dominated by the Gaussian factor and is 0{e " 



2(»)+«)x 



The integral over 



the disk |z| < n''+'^ is bounded by the supremum of (6.53) since the Gaussian 
integrates to one, and is 0(e~*^'*'/^~''~'^)"^). In the last step we use Stirling's 

formula to obtain log (n''+'^)"^/-\/nTl « (77 + e — 7/2)71'*' logn. Note that the 

estimate is uniform with respect to n — 1/2 > €2 for any fixed €2 > 0. 



B.2 Second Estimate 

We now compare the evolved qubits state p^j^it) and the evolved oscillator state 
(j)^{t). Let |V'mj(*)) ~ ^j,n{t) |m) (8) be the joint state at time t when the 
initial state of the system is \m) corresponding to \j,j — m) in the basis 
notation. We choose the following approximation of |V'm,j(*))' 

m 

ICjW) :=^c„(TO,z)ai(t)|m-i)®|e-i/2«;^[0,t](«))i, (6.54) 

i=0 

where ai{t) = exp{{—m + i)t/2), and we define the numbers Cn{m, i) inductively 
by Cn{m, 0) := 1 and 

Cn{m,t) := Cn{m,i- 1)^^-1^,/^^. 

The n-particle vector |/)„ := Z*^" was defined in (6.17). Note that in particular, 
for /u — 1/2 > €2 and j G Jn, we have 

C„(TO,i) < ^(7)(l+|n-V2+e)i. 

We now apply the estimate (6.21). By direct computations, we get 

^ m 

d\Cj{t)) = -^^Cn{m,i)ai{t){m - i)\m - i) \e-^/^''xio,t]{u))idt 

(6-55) 

+ '^Cn{m,i)ai-i{t)\m - i) O |e"^/2"x[o,t](w))i-i |X[t,t+dt]), 

i=l 
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where 

i+l 

5 := ^ / ® / ® • • • 5 • • • ® /• 

k=l 

From the quantum stochastic differential equation, we obtain 



|m - i - 1) ® |e-i/2"x[o,t](w))i ®s |X[t,t+dt]) 
i 5^c„(m,i)a,(t)(m - i y^-^+J+' \m - i) ® |e-V2"x[o,t] W)^^* • 



2 

i=0 



In the first term of the right hand side of the above equation, we can replace 
c„(m, i) *'™~'2j-^'(~^)^^"^ by c„(to, z + 1) and thus we obtain the same sum as 
in the second term of the left side of (6.55). Thus 



GdtlCj(*))-^^ICjW) = 

i J2 Cn{m,i)ai{t){m - j) ^On-i)+m-»-i |^ _ ^ \e-'/^''x[o,t]{u))i dt. 



m—l 

2 



i=0 



Then, using c„(m,i) < ■^J + (2/e2)n-i/2+e)i^ ^e find that the expression 
ll<^dtCj(*) - c^Cj(*)ll is bounded from above by 

"'-1 2nl/2 



i=0 

We have 

Inside the sum, we recognize the binomial terms with the m*'^ term missing. 
Thus the sum is 

(l + n-V2+^ - e-*n-V2+^)" - ((1 - e-*)(l + 

< (1 + n-V2+^)™(l - (1 - e-*)™) < (1 + n-i/2+^)™ me"*. 

Then there exists a constant C (independent of if /x — 1/2 > £2) such that 

l|G.tC„- W - dC.,m < |e-*/^m^/2(n-V2+^ + ran-') (l + ^^"^2+ j 
By integrating over t, we finally obtain 

IIC,W-C,,(i)ll<Cm3/'(ri-'/'+^ + mn-^)(^l + -n-i/2+^j . (6.56) 
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Note that under the assumption 7 < 1/3 — 2e/3, the right side converges to zero 
at rate ji37/2-i/2+£ £qj. all m < to = riP'. Summarizing, the assumptions which 
we have made so far on 7 are 

27? + 2e<7< l/3-2e/3. 

Now consider the vector IV'zj) as defined in (6.52), and introduce the no- 
tation \ip^j{t)) = Uj^„{t)\ip^j) b Then based on (6.54), we choose the 
approximate solution 

m m 

Kjit)) = e-'^''/' E tS E c^n{m,i)a,{t)\m - i) ^ \e-y'^X[o,t]{u))i. 

m=0 i=0 

Note that the vectors |V'fej(i)) aiid \£,kj{t)) hve in the "/c-particle" subspace of 
Hj(g)J^{L'^{M.)) and thus are orthogonal to all vectors \i)pj{t)) and |Cpj(i)) with 
p^k. By (6.56), the error is 



1/2 



m! 



< 



IK,-(i)-ez"i(i)||< 

(m 
m=0 

CTO3/2(n-l/2+^ + ^„-l) (1 + A„-l/2+.^"/' ^ 

(6.57) 

We now compare the approximate solution j(t) with the "limit" solution ip'z,{i) 
for the oscillator coupled with the field as described in section 6.4.2. We can 
write 

00 m 

Mt) = e-l^l^/^ E 55 E ^e-(-^)*/2|,n - i) ^ |e-V2«X[o,t](«))i. 



TO=0 i=0 



Then 



IICjW-V'z(t)||' = e-l^l^ J2 



m! 



Now 



+ e-H^E^E^- 



{m—i)t 



m=0 i=Q 



t\i 



(1-e-*) 



Cn{m,i) 



< |c„(TO,i)2-(7) 



< (T) 



n(> 



1-11(^1 + 



20'-jn)-"t+P 
2jn > 



< C2('")TOn-i/2+% 
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where C2 does not depend on ^ as long as — 1/2 > e2 (recall that the depen- 
dence on n is hidden in _7„ = (2/i — l)n). Thus 



2m 



1 2m 



m=0 
\2fh 



ml 



m! 



rn! 



(6.58) 



From (6.57) and (6.58) we get 



m/2 



ml 



C2n-i/2+^|z|2. 



m! 



1/2 



and wc denote the r.h.s. by E{rh,n,z). We now integrate the coherent states 
over the displacements z as we did in the case of local asymptotic normality in 
order to obtain the thermal states in which we are interested 



■ \/27rs2 J 
If we define the evolved states 

Plnit) ■■= U,,n{t)PlnUj,nit)^ and 0"(i) := U{t)ct>-U{t)\ 
then supj.g^^ sup||„||<„^ l|p"„(^) - 4>'^{t)h is bounded by 

sup /e-l^-^^'^«"l'/2«'^(m,n,z)dV 

Here again we cut the integral in two parts. On |z| > n''"'"^, the Gaussian 
dominates, and this outer part is less than e~"''^\ The inner part is then 
dominated by sup|2.|<„,,+£ i?(m, n,z). Now on the one hand, we want (6.57) to 
be small, so that m cannot be too big. On the other hand, we want z^^/m! to 
go to zero. A choice which satisfies the condition is 7 = 2?7 + 3e. By renaming 
e, we then get 



E{m,n,z) = 0{n 



= rO('r,'7-V4+e ^3r/-l/2+e 



for any small enough e > 0. Hence we obtain (6.22). 
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Bundles with a Lift of 
Infinitesimal Diffeomorphisms 

This chapter is not about algebra and quantum mechanics, but about differential 
geometry and classical field theory. We slightly extend the notion of a natural 
fibre bundle by requiring diffeomorphisms of the base to lift to automorphisms 
of the bundle only infinitesimally, i.e. at the level of the Lie algebra of vector 
fields. Spin structures are natural only in this extended sense. We classify the 
fibre bundles with this property, under the additional assumption of a finite 
dimensional structure group. This includes all spin structures, but only some 
Spin^ and SpinG-structures. This classification links the gauge group G to the 
topology of space-time. 

Remark Many of the results in this chapter, including the main theorem 
7.14, can be found in Lecomte's paper [Lec85]. Our results were obtained inde- 
pendently. 

7.1 Introduction 

All bundles are equal, but some are more natural than others. For example, 

the naturality of tangent bundles, frame bundles and jet bundles is beyond all 
question. A bundle is called 'natural' if diffeomorphisms of the base lift to 
automorphisms of the bundle in a local fashion. 

Let us phrase this more carefully. Any principal fibre bundle tt : P — )• M 
determines an exact sequence of groups 

1 ^ rc(Ad(P)) ^ Autc(P) Difff (M) 1 . (7.1) 

In this expression, Autc(P) is the group of automorphisms of P that are trivial 
outside a compact subset of M. Its image Difff (M), the group of compactly 
supported liftable diffeomorphisms of M, is an open subgroup of Diffc(M). 
In particular, it is big enough to contain its connected component of unity. 
We have identified the gauge group of vertical automorphisms with the group 
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of sections of the adjoint bundle Ad(P), and rc(Ad(P)) is the subgroup of 
compactly supported ones. 

Definition 7.1 A 'natural' principal fibre bundle is a principal fibre bundle 
P for which all diffeomorphisms are liftable and (7.1) is split, together with 
a distinguished splitting homomorphism S : Diffc(M) — )• Autc(P). Moreover, 
the splitting homomorphism S is required to be local in the sense that it should 
lift each diffeomorphism 4> : Ui ^ between open subsets of M to a bundle 
isomorphism : -jt~^{Ui) 7r~^(J72) in a functorial fashion. 

Natural bundles have been classified. A theorem of Salvioli, Palais, Terng, Ep- 
stein and Thurston [Sal72, PT77, ET79] states that any natural fibre bundle is 
associated to the A:*^ order frame bundle P'^(M). 

In this chapter, we seek to extend the notion of a natural fibre bundle in 
two separate ways. First of all, we do not require locality, but prove it. And 
secondly, we only require diffeomorphisms of the base to lift to automorphisms 
of the bundle infinitesimally, i.e. at the level of Lie algebras. We will call the 
principal fibre b^indles which are natural in this extended sense 'infinitesimally 
natural', and our main objective will be their classification. 

Let us formulate this more precisely. For any principal fibre bundle, the 
sequence of groups (7.1) gives rise to the exact sequence of Lie algebras 

^ rc(ad(P)) T^{TPf re(TM) , (7.2) 

where 'c' stands for '0 outside a compact subset of M'. 

Definition 7.2 An 'infinitesimally natural' principal fibre bundle is a principal 
fibre bundle P, together with a distinguished Lie algebras homomorphism a : 
rc(TM) — >■ rc(TP)'^ that splits the exact sequence of Lie algebras (7.2). 

We emphasize that we will not require cr to be local, continuous or C°°{M)- 
linear, and it need not come from a map of bundles. We only require cr to be a 
homomorphism of Lie algebras. 

The price we pay for this level of generality is that we must confine ourselves 
to fibre bundles with a finite dimensional structure group. This allows us to first 
study principal fibre bundles, and then generalize the results to more general 
fibre bundles with a finite dimensional structure group, such as vector bundles. 

There are three main reasons for wanting to extend the notion of a natural 
bundle. First of all, natural bundles are meant to describe geometric objects, 
but not all geometry is local. One might feel that the locality requirement 
on S is therefore inappropriate. The universal cover M — > M for instance 
is not a natural bundle, although its ties with the global topology of M are 
unmistakable. It does however allow for a lift of infinitesimal diffeomorphisms, 
and is therefore infinitesimally natural. 

The second reason comes from field theory. Fields are described by sections 
of a fibre bundle F over space-time M, where F is associated to some principal 
fibre bundle P. Fermions, for example, are sections of a spinor bundle F, 
associated to a spin structure P. As spinors transform according to a projective 
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representation of the Lorentz group, they acquire a minus sign upon a full 
rotation. This means that there is no hope of lifting global diffeomorphisms, so 
that spinor bundles cannot be natural. One might feel however that it would 
be unfair to discard them as unnatural. After all, they do occur in nature, or 
at least in our mathematical description of it. As one can lift diffeomorphisms 
infinitesimally, i.e. at the level of vector fields, a spinor bundle does constitute 
an infinitesimally natural bundle. 

The third reason is that in field theory, one would like to construct a 
stress-energy-momentum tensor, corresponding to infinitesimal transformations 
of space-time M . Unfortunately, Diff(M) does not act on the fields T{E) di- 
rectly. Since Aut(P) does, we need an infinitesimal lift of Diff(Af) into Aut(_P), 
i.e. a Lie algebra homomorphism a that splits (7.2). This means that an in- 
finitesimally natural bundle is precisely what one needs in order to construct a 
SEM-tcnsor from Noethcr's principle. (Sec chapter 2, or e.g. [GM92, FR04].) 
The splitting a is to be interpreted as the transformation behaviour of the fields 
under infinitesimal space-time transformations. 

The outline of the chapter is as follows. Sections 7.2, 7.3 and 7.4 are devoted 
to the classification of infinitesimally natural principal fibre bundles. The central 
result is theorem 7.14, which states the following. 

Any infinitesimally natural principal fibre bundle is associated to the universal 
cover of the connected component of the k^^ order frame bundle F^'^{M). 

We extend this to fibre bundles with a finite dimensional structure group in 

section 7.5, with special attention for vector bundles. In section 7.6, we seek 
conditions under which a splitting of (7.2) gives rise to a flat connection. 

Finally, in section 7.7, we study spin structures in the presence of a gauge 
field. Ordinary spin structures are infinitesimally natural, as they carry a 
canonical splitting of (7.2). In the presence of a gauge group G however, one 
should consider SpinG-structures instead (see [HP 78], [AI80]). For example, if 
G = U{\), the appropriate bundles are Spin'^-structures. In contrast to ordi- 
nary spin structures, not all SpinG-structures are infinitesimally natural. We 
show that for a compact gauge group, infinitesimally natural SpinG-structures 
correspond precisely to homomorphisms 7ri(F(M)) — )• G that are injective on 
7ri(GL(K")). 

In a sense, this shows that comphcated manifolds call for complicated gauge 
groups. Some manifolds, such as CP^, do not allow for any infinitesimally nat- 
ural SpinG-structure at all. If one feels that fermions should have a well defined 
transformation behaviour under infinitesimal space-time transformations, then 
these manifolds are disqualified as models for space-time. 

7.2 Principal Bundles as Lie Algebra Extensions 

We seek to classify infinitesimally natural principal fibre bundles. It is to this 
end that we study Lie algebra homomorphisms a that split (7.2). In this section, 
we will prove that a must be a differential operator of finite order. 
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The first step, to be taken in section 7.2.1, is to show that maximal ideals in 

Tc{TM) correspond precisely to points in M. Using this, we will see in section 
7.2.2 that (J must be a local map. Once again, we stress that locality of a will 
be a theorem, not an assumption. We will then prove, in section 7.2.3, that a 
is in fact a differential operator. 

Before we proceed, let us have a closer look at the exact sequence of Lie 
algebras (7.2), derived from a smooth principal G-bundle tt : P — )• M. The last 
term, TdTM), is the Lie algebra of smooth, compactly supported vector fields 
on M. The middle term, TdTP)'^, is the Lie algebra of G-invariant vector 
fields f on P such that 7r(Supp(i')) is compact. The pushforward defines a Lie 
algebra homoniorphism tt* : TdTP)'^ — > Tc{TA4) because of G-invariance, and 
its kernel Tc{TP)^' is the ideal of vertical vector fields. It can be identified 
with rc(ad(P)), the Lie algebra of compactly supported sections of the adjoint 
bundle ad(P) := P Xq Q, where g is the Lie algebra of G. 

7.2.1 Ideals of the Lie Algebra of Vector Fields 

The following lemma, due to Shanks and Pursell [SP54] , constitutes the linchpin 
of the proof. It identifies the maximal ideals of Tc{TM) as points in M. The 
proof is taken from [SP54] , with perhaps some minor clarifications. 

Lemma 7.1 Let Tc(TM) be the Lie algebra of smooth compactly supported vec- 
tor fields on M. Then the maximal ideals of T(.(TM) are labelled by points q 
in M, Iq being the ideal of vector fields in Tc{TM) which are zero and flat 
at q. That is, Ig = {v G rc{TM)\v{q) = and (ad(t«i J . . . ad(wi„)w)(g) = 

0Vw;,,,...w,„,er,(rM)}. 

Proof. Let / C Tt.{TM) be an ideal. Suppose that there exists a point q £ M 
such that v{q) = for all v G I. Then I C 7^. Indeed, for all Wj^, . . . ,Wi^ G 
Tc{TM), one has (ad(u>jj . . . a.d{wi^)v){q) ~ because the l.h.s. is in I. 

Now suppose that / does not have such a point. We will prove that this 
implies I = Tf.{TM). Prom this, the lemma will follow. If / is a maximal 
ideal, then surely / 7^ rc(rM), so we must have I Q Iq. But it is clear from 
its definition that Iq is an ideal, so, / being maximal, we must have / = Iq. 
Conversely, the ideals Iq are all maximal, because any enveloping ideal must 
either be all of Tc{TM), or else be contained some Iq, which cannot be unless 
q = q. 

We are therefore left to prove the following: suppose that / is an ideal such 
that for all q G M, there exists a. v G I with v{q) ^ 0. Then I = rc(TM). 

We will shortly prove the following statement: for all q € M, there exists a 
neighbourhood Uq such that all w G Tc{TM) with Supp(ti;) C Uq can be written 
as w — [v, u] with v E I and u E TdTM). 

Such w are therefore elements of /. Membership of / then extends to arbi- 
trary w G FciTM) by covering its support with a finite amount of neighbour- 
hoods Uq-^ . . . Uqj^, for which we construct a partition of unity X^fLi /« = 1; 
Supp(/s) C Uq, . We may then write w = J2^=i fsW = ^^fLibsjWg] G I. This 
concludes the proof modulo lemma 7.2, which is next in line. □ 
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Although one can easily find u e Tc{TM) and v G I such that w = [u, v] 
locally, it is not always clear how to extend these to global vector fields while 
simultaneously satisfying w = [u,v]. For instance, with M the circle and 

V = w = dg, the solution u = Ode does not globally exist. We need to do some 
work in order to define a proper cutofi^ procedure. 

Lemma 7.2 Let I he a maximal ideal of Tc(TM) containing a vector field 
which does not vanish at q € M. Then q has a neighbourhood Ug such that 
all w e Tc{TM) with Supp(u') C Ug can be written w = [v,u] with v G I and 
u e Tc{TM). 

Proof. Choose v G I with v{q) ^ 0. There exist local co-ordinates xi,. . . ,Xn 
and an open neighbourhood W of q such that v\w = di. Choose W to be 
a block centered around q, and nest two smaller blocks (in local co-ordinates) 
inside, so that W D V D U . 

We take W = (-£,e)", V = (-|e, |e)", and U = |£)". Choose a 

smooth function g on M such that g\u = xi and g\M\v = 0- Require also that 
dig{x) = for i ^ 1 and {x2, ■ ■ ■ , Xn) S (— |e, |£)"~^. Define h := dig, and set 

V := [v,gdi\ = hdi. Then v\u = di, v\m\v = 0, and most importantly v € I. 

Now let w G Tc{TM) with Supp(w) C U. We will find a global vector field 
u e TciTM) that realizes [v,u] = w. 

For i 1, the i^^ component of the above reads hdiu^ = w"^ . In the region 
a;i < |e, we set . . . , a;„) = J_]^w^(t,X2 ■ ■ ■ ,Xn)<^t. On t/, where {; = 9i, 

we then have diu^ = and therefore = w'^. This is obviously also correct 

for points outside U with xi < ^e, as both and are zero. 

For |e < cci < |£, where w = 0, we let u^{xi,X2, ■ ■ ■ ,Xn) be a constant 
function of Xi, so that u'^{x-i,X2, ■ ■ ■ , = X2,---, Xn). We then have 

. . . , .x„) = 0, guaranteeing hdiu^ = w^. Note that this does not effect 
the smoothness of u*. 

Finally, for |£ < xi < e, let tend to zero, and let be zero for xi > e. 
This can be done in such a way that remains smooth. Since both h and w 
are zero, we have hd\u^ = on all of M. 

The case i = 1 is handled similarly, the only difference being that the first 
component of = w is now hd\v} — '^jU^djh = w^, the term u^dih of 

which cannot be dispensed with. We have arranged for w*(x), with i ^ 0, to be 
zero if {x2, ■ ■ ■ ^ (— |£)"~^, so that dih equals zero if u' does not. This 
leaves us with hdiv} — v}dih = w^. 

For Xi < ^e, one once again sets u^{xi, . . . , a;„) = w^{t, X2 ■ ■ ■ , x„)di. 
For g£ < xi < |£ however, one now has to define . . . , Xn) = h{xi, . . . , Xn) 

u'(|£, X2,. . . ,x„) in order for hd\v} — u^dih = vu^ to hold. Since this renders 

zero on the boundary of V, one is then free to define ?i* to be zero on M\V. 

Thus, if Supp(i(;) C U, we see that w = [v,u] with v G I and u € Tc{TM). 
This concludes the proof lemma 7.2, and thereby that of lemma 7.1. □ 

A maximal subalgebra yl of a Lie algebra L is either self-normalizing or ideal. 
Indeed it is contained in its normalizcr, which therefore (T[iials either A or L. 
A theorem of Barnes [Bar62j states that a finite-dinieiisional Lie algebra is 
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nilpotent if and only if^ every maximal subalgebra is an ideal. On the other 
extreme: 

Proposition 7.3 Let L he a Lie algebra over a field K, and let S he the set of 
subspaces A C L such that A is both an ideal and a maximal subalgebra. Then 

[L,L] = fl A. 

AGS 

In particular, since the r.h.s reads 'L' if S = $, L is perfect {[L,L] = L) if and 
only if every maximal subalgebra is self -normalizing. 

Proof. Let X ^ {L,L]. Then choose C A C L where A has codimension 

1 in L, and X ^ A. A is an ideal max;imal subalgebra, which does not contain 
X. Thus Hags^C 

Let A be an ideal maximal subalgebra, and X ^ A. Then A + KX is 
a subalgebra strictly containing A, so that it must equal L. Thus [L,L] = 
[A + KX, A + KX] C A, whence [L, L] C f]^^^ A. □ 

As a corollary, we have the following well known statement: 
Corollary 7.4 The Lie algebra Fc (TM) is perfect; 

[T,{TM),T,{TM)] =T,{TM) . 

Proof. According to lemma 7.1, the maximal ideals are precisely the ideals Iq 
of vector fields in Tc{TM) which are zero and flat at q. Iq is strictly contained 

in the subalgebra Ag of vector fields which are zero at q, so that no ideal is a 
maximal subalgebra. So every maximal subalgebra is self-normalizing. □ 

7.2.2 The Splitting as a Local Map 

With the main technical obstacles out of the way, we turn our attention to the 
sequence (7.2). We now prove that ct is a local map. 

Lemma 7.5 Let P ~^ M be a principal G-bundle over M . with G any Lie 
group. Let a : rc{TM) — > rc(TP)'^ be a Lie algebra homomorphism splitting 
the exact sequence of Lie-algebras 

^ rc(TP)f ^ TciTP)^ TciTM) . (7.3) 

Then a is local in the sense that 7r(Supp(o-(i)))) C Supp(t;). 

Proof. Any principal fibre bundle possesses an equivariant connection 1-form 
oj £ Q^{P,q), which enables one to lift vector fields. More precisely, the lifting 
map 7 : TciTM) Tc{TP)'^ is defined by the requirement that ujp{'y{Vjj.^p-))) 
equal zero for all p G P, and that tt* o 7 be the identity. The lifting map 7 splits 

^Actually, the 'only if part in Barnes' theorem is not written down in [Baj62], but this is 
immediately clear from the proof of Engel's theorem. (See e.g. [Hum72]). 
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the exact sequence (7.3) as a sequence of C°°(M)-modules, but generally not 
as a sequence of Lie algebras, since it need not be a homomorphism. 

Define / := cr-7. Then 7r*o / = 0, so that / is a map Tc{TM) Tc{TP)^. 
Since 7 is local by definition, the lemma will follow if we show / to be local. 

The homomorphism property of a, [a{v),a{'w)] — a{[v, w]) = 0, translates to 

fi[v,w])-[fiv)J{w)]= ^^^^ 
[-yiv], f{w)] - [yiw), f{v)] + [7(1^), 7(^)1 - 7(b, w])- 

The G-action effects a Lie algebra isomorphism TdTP)'^ ~ rc(ad(P)). From 
that perspective, we have for s G rc(ad(P)) that ['y{v),s] = V„s, the covariant 
derivative along v. The curvature of the connection is then given by R{v, w) = 
['y{v),'y{w)] — 7([t;, w]). We rewrite equation (7.4) as 

f{[v, w]) - [f{v), f{w)] = VJ{w) - V^fiv) + R{v, w) . (7.5) 

Pick m e M, and identify the fibre of ad(P) over m with the structure Lie 
algebra g. The restriction r„i '■ rc(ad(P)) — >■ g is a homomorphism of Lie 
algebras. Now define ■= {v S rc{TM) \ v{m) = 0} to be the maximal 
subalgebra of vector fields that vanish at m, and consider the 
defined by = o /|^^. Equation (7.5) transforms into 

fm{[vM) - [fm{v)Jm{w)] = V,(/(w))„ - V^{f{v))m + R{v,w)m , (7.6) 

the r.h.s. of which vanishes because V and R are C°° (M)-linear in v and w, 

and v{m) = w{m) = 0. This means that fm is a Lie algebra homomorphism. 

Let us restrict fm even further to the ideal rc(T(M — {m})) C Am (which 
is non- maximal) , and note that : rc{T{M — {m})) — )• is a homomorphism. 
Its kernel Kcr(/m) is therefore an ideal in Tc{T{M — {m})), and one of finite 
codimension at that. Indeed, 

r,{T{M - {m}))/Ker(/„) ~ Im(/„) C g . 

According to lemma 7.1 however, all proper ideals arc of infinite codimension, 
forcing Ker(/TO) = Tc{T{M — {m})). But the vanishing of fm for all m G M is 
tantamount to locality of / in the sense that Supp(/(w)) C Supp(t;). □ 

We gather some notation that we introduced in the course of the proof. If 

a : Tc{TM) — >■ Tc{TP)'^ is a Lie algebra homomorphism splitting tt* and 7 : 
Tc{TM) — >■ Tc{TP)'^ is the lift induced by an equivariant connection, then we 
define / := tr — 7. It maps to the vertical vector fields, identified with r(ad(P)). 
Wc define Am to be the maximal subalgebra of vector fields that vanish at 
TO, and fm '■ Am — >■ to be the restriction of / to Am, followed by the map 
r(ad(P)) — )• g which picks out the fibre over m and identifies it with q. 

7.2.3 The Splitting as a Differential Operator 

In this section, wc will prove that o" is a differential operator of finite order. 
Since / is local, it defines a map from the sheaf of smooth sections of TM to 
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the sheaf of smooth sections of ad(P). An elegant theorem of Peetre ([Pee60]) 
then says that /, and therefore cr, must be a differential operator of locally finite 
order. All we need to do is find a global bound on the order, which will occupy 
us for the remainder of the section. 

The fact that / is a differential operator of locally finite order means that 
for each m G M, there exists an r such that H^^ — {v £ Am Ij^i"^) = 0} 
is contained in ker(/m), where jm(v) is the r-jet of v at m. Consequently, 
factors through the jet Lie algebra J^^iTM) := Am/H^. Since we do not know 
r, we define 

Vec„ := lim Am/ H!^ , 

and remark that fm induces a homomorphism Vec„ g. The Lie algebra Vec„ 
depends on M only through its dimension n. 

Local co-ordinates provide one with a basis x"3j, where is shorthand for 
x"^ . . . x"". With Vec^ = Span{a;"9i | |a| = fc + 1 , i = 1 . . . n}, one may write 

oo 

Vec„ = 0Vect 

k=0 

That is, each element of Vec„ can be uniquely written as a finite sum of 
homogeneous vector fields. Note that Vecj!j is the fc-eigenspace of the Euler 
vector field E :— X)r=i^*^«- ^ ideal containing v = 'Yli^^a'^k, then 

&d{Eyv = J2k=o ^'''^k G I for all j, so that Vk G /• Thus any ideal splits into 
homogeneous components 

7 = 07'= 

with 7*^ = 7 n Vec^. This renders the ideal structure of Vec„ more or less 
tractable, so that we may prove the following bound on the order of fm- 

Lemma 7.6 The order of the differential operator a is at most dim(g) unless 
dim(M) = 1 and dim(0) = 2, in which case the order is at most 3. 

Proof. We closely follow Epstein and Thurston, [ET79]. One checks by hand 
that the only ideals of Veci = Spanjaj^'^ \k > 1} are Span{a;^9, x'^d | > 4} 
and Span{x''d \k>N} with N > 1. 

Consider Veci as a subalgebra of Vec„, define K to be the kernel of fm, and 
let Ki := K (1 Veci. We then have injective homomorphisms 

Veci/Ki M- Yecn/K M- g , 

so that dim(Vcci/7iri) < dim(g). As Ki is an ideal, it must be of the shape 
mentioned above. This leads us to conclude that x^di G K for all k > dim(0) 
unless dim(g) = 2, in which case x'^di E K for all fc > 3, and x\di E K. 

The following short calculation shows that if dim(g) = 2 and dim(M) > 1, 
then also x\di G K. As K contains x^di, it also contains [xldi,xid2] — x\d2, 
and thus [x\d2TXiX2d-i\ ~ x\di — 2x\x2d2- But by bracketing with x\d2 and 
X2di respectively, we see that x\di — 2)x\x2d2 is in K, ergo x\di G K. 
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The next step is to show that if xfdi € K, then K also contains all x"di 
with \d\ = s. First of all, we remain in K if we repeatedly apply ad^Xidi) to 
x^di, to the effect of replacing xi by Xi up to a nonzero factor. This shows 
that € K. Then the relation x"di = [x"di,xidi] + XidiX"di transfers 

membership of K from right to left. 

In the generic case dim(g) 2, dim(M) ^ 1, we may conclude that the 
order of a is at most dim(fl), because Hm"^^^^ c K. In the exceptional case that 
dim(0) = 2 and dim(M) = 1, the order of a is at most 3. □ 

In particular, cr is a differential operator of finite rather than locally finite order. 

Let us summarize our progress so far. 

Proposition 7.7 Let P he an infinitesimally natural principal G -bundle. Then 
u : rc(rM) — > rc(TP)'^ factors through the bundle of k-jets, where k = 3 if 
dim(M) ~ I, dim(g) = 2 and k — dim(g) otherwise. If we identify equivariant 
vector fields on P with sections of the Atiyah bundle TP/G, we can therefore 
define a bundle map V : J^{TM) TP/G by VO'i(w)) := It makes 

the following diagram commute. 

V{TM) — ^ T{TP/G) 

V 

T{J^{TM)) 



The point is that although a is defined only on sections, V comes from a ver- 
itable bundle map J^{TM) TP/G. Note that although a was only defined 
on Vc{TM), it extends to T{TM) by locality. 



7.3 Lie Groupoids and Algebroids of Jets 

The bundles J'^{TM) and TP/G arc Lie algebroids, and it will be essential for 
us to prove that V : .J^{TM) TP/G is a homomorphism of Lie algebroids. 
In order to do this, we will first have a closer look at [TM) and TP/G, and 
at their corresponding Lie groupoids. 

Let us first set some notation. The jet group Gq o(I^") is the group of /c-jets 
of diffeomorphisms of M" that fix 0. It is the semi-direct product of GL(M") 
and the connected, simply connected, unipotent Lie group of fc-jets that equal 
the identity to first order. 

The subgroup Go_q(M") of orientation preserving A:-jets is connected, but 
not simply connected. As Gq o(lR") retracts to SO(R"), its homotopy group is 
isomorphic to {1} if n = 1, to Z if n = 2, and to Z/2Z if n > 2. For brevity, we 
introduce the following notation. 

Definition 7.3 If k > 0, we denote tti(G^'^{R")) by Z. 

Thus for n>2, the universal cover Gjg(M") G^^(M") is 2 : 1, and restricts 
to the spin group over SO(M"). 
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7.3.1 The Lie Groupoid of k-3ets 

In this section, we define the Lie groupoid G'^{M) of fc-jets, its maximal source- 
connected Lie subgroupoid G^^{M), and the k^^ order frame bundle F^{M). 

Denote by Gj^, ^{M) the manifold of fc-jets at m of diffeomorphisms of M 
which map m to m' , and denote by G''{M) = UmxmG^, ^{M) the groupoid 
of fc-jets. If is a fc-jet at m of a diffeomorphism a, then its source is 

s(j^(a)) = m, its target is t{j^{a)) = a{m), and multiplication is given by 
concatenation. 

Denote by the manifold s^^(m) of fc-jets with source m. The target 

map t : G^ ^{M) — )■ M endows it with a structure of principal fibre bundle, 
the structure group G^„(M) ~ G§g(K") acting freely and transitively on 
the right. As G^(M)*_m is isomorphic to the frame bundle F{M), one calls 
G'^{M)^,,rn the fc**^ order frame bundle, sometimes denoted F^{M). 

Lemma 7.8 Let M he connected and let k > 1. Then G''{M) is source- 
connected if and only if M is not orientable. 

Proof. We may as well consider fc = 1, because the fibres of G'^{M) G^{M) 
can be contracted. Each source fibre G^{M)^^m of G^(M) is isomorphic to the 
frame bundle. By definition, M is oriented precisely when the frames can be 
grouped into positively and negatively oriented ones. □ 

Definition 7.4 We define G~^''{M) to be the maximal source- connected Lie sub- 
groupoid ofG^(M). and denote its source fibre hy F'^^{M). 

In the light of the previous lemma, this means that G^'' {M) is the Lie groupoid 
of fc-jets of orientation preserving diffeomorphisms if M is orientable, and simply 
G'=(M) if M is not. 

Note that the map D : Diff(M) Diff (G'=(M)) defined by Da : ^ 
jmic^l) is a homomorphism of groups. We will call it the fc**^ order derivative. 
Because Da is source-preserving and right invariant, it defines a homomorphism 
Difr(Af) Aut'^™.'-(^'^)(Gj „(M)), splitting the exact sequence of groups (7.1). 
This makes Gj_„(M) = F''{M) into a natural bundle. Note that F+'=(M) is 
infinitesimally natural. 

7.3.2 The Lie Algebroid of A -Jets 

The bundle J^{TM) possesses a structure of Lie algebroid, induced by the Lie 
groupoid G''{M). We now describe the Lie bracket on V{J'^{TM)) explicitly. 
Later, in section 7.3.4, we will use this to show that V is a Lie algebroid homo- 
morphism. 

The Lie algebroid of G^(M) is a vector bundle A — >■ M. Its fibre Am is 

by definition the subspace of the tangent space of G^{M) at jm(id) which is 
annihilated by ds. Sections of A therefore correspond to right-invariant vector 
fields on G^{M) parallel to the source fibres. 

Each curve in Gj m{-^) through j^(id) takes the shape c{t) = jmio^t) with 
ao = id, so that its tangent vector a E A,n takes the shape a = jmi'^)^ with 
v = dt\oaf This shows that A ~ J''{TM). 
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The anchor dt : J'^{TM) TM is easily seen to be the canonical projection, 
so we shall denote it by tt. The Lie bracket on r( j''{TM)) however, which is 
defined as the restriction of the commutator bracket on r(rG'^(M)) to the right 
invariant source preserving vector fields, perhaps deserves some comment. 

Define J'^ °(rM) to be the kernel of tt, and consider the exact sequence of 
Lie algebras 

^ r(j'='°(rM)) ^ T{j''{TM)) A r{TM) -s> o. 

It is split by j'' : T{TM) r(j'=(rM)), the infinitesimal version of the k^^ 
order derivative. (The sequence of Lie algebroids of course does not split, as 
differentiation is not linear over C°°{M).) 

We will describe the Lie bracket on T{J''{TM)) by giving it on T{TM) and 
r(,/''-"(TA/)) separately, and then giving the action of r(rAf) on T{j''-'-\TM)). 

Proposition 7.9 Let u and u' be sections ofTM, and letr : Jmi'^m) o,nd 
t' :m<-^ im(^m) be sections of J^'°{TM). Then 

[j''iu),T]m = jU[u,Vm])+jt{du\m{x^V:,)), 

where du\mix i— )■ v^) is the ordinary derivative at m along u of a map from M 
to r(TM). Although both terms on the right hand side depend on the choice of 
m>-^ Vm, their sum does not. 

Proof. The first equality is clear, as j'^ is a homomorphism of Lie algebras. 
The second equality can be seen as follows. Consider the bundle of groups 
G^{M)^,^ := € G^{Al)\a{m) = m}, with bundle map s = t. Its 

sections r(G'^(M),^,) form a group under pointwise multiplication, the Lie al- 
gebra of which is r( J*='°(TM)), with the pointwise bracket. As r(G'=(M)*,*) 
acts from the left on G^{A1) by .7,^^(7) ^ imi'^m) ° Jm(7)> respecting both the 
source map and right multiplication, the inclusion r(j'='°(TM)) T{J^{TM)) 
is a homomorphism of Lie algebras. This proves the second line. 

To verify the third line, we must choose a smooth map x ^ from M to 
r(TM) such that Tx = j^i'^x) in a neighbourhood of m. Each Vx necessarily 
has a zero at x. If we denote m(s) := ex.p{su)m, then the bracket [j''{u),T]m 
is by definition^ minus the mixed second derivative along s and f at of the 
groupoid commutator 

im(s)(exp(-su))j^(,)(exp(-to„(,)))j^(exp(sw))j^(exp(to„)), 

which is just (exp(— su) exp{—tVm{s)) exp(su) exp(to„i)) . The terms not in- 
volving derivatives of s >-)■ m(s) yield ^^([w, ^to]), and the terms which do 
provide the extra jm{du\m{x fx))- 1^ 

^The Lie algebra of the diffeomorphism group is the Lie algebra of vector fields, but the 

exponential map is given by v h-> cxp(— u), where exp denotes the unit flow along v. This is 
why the groupoid commutator might seem odd at first sight. 
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7.3.3 The Gauge Groupoid and its Algebroid 

Given a principal G-bundle n : P ^ M, one can define the gauge groupoid 

(P X P) /G, that is the pair groupoid modded out by the diagonal action. Source 
and target come from projection on the second and first term respectively, 
M ^ {P X P)/G as id^(p) = and multiplication is well defined by 

[(r, g)] o [(q^p)] = [(r,p)]. An element [{q,p)] corresponds precisely to a G- 
equivariant diffeomorphism w^^{p) — > Tr^^{q), and the product to concatenation 
of maps. 

Its Lie algebroid TP/G is sometimes called the Atiyah algebroid. Indeed, the 
subspace of Tii^{[P x P)/G) which annihilates ds is canonically {TP/G)m- A 
section of TP/ G can be identified with a G-equivariant section of TP, endowing 
T{TP/G) with the Lie bracket that comes from T{TP)^. 

7.3.4 The SpUtting as a Homomorphism of Lie Algebroids 

The point of considering the Lie algebroid structure of J'^ (TM) was of course 
to prove the following. 

Lemma 7.10 The map V : J^{TM) TP/G is a homomorphism of Lie 
algebroids. 

Proof. The fact that V respects the anchor is immediate. As tt* o V(j^ («,„)) = 
TT, o (j{vm)(jn) = Vm{m), it equals tt o j^{vm) in the point m. 

We now show that V : T{J^{TM)) T{TP/G) is a homomorphism of 
Lie algebras. First of all, the restriction of V to j''{T{TM)) is a homomor- 
phism. Indeed, [V(j'^(t))), V(j*'(w))] = [a{v),a{w)], which equals (t([?;,w;]) 
because cr is a homomorphism. This in turn is just 'V{j'^{[v,'w])), so that 

[V(/W),v(/H)] = v([,7'=H,j'=H]). 

Secondly, its restriction to T{J''^°{TM)) is a homomorphism. If = .7* ( 
and Vx = jxiwx) are sections of J'^'°{TM), then Vr and Vv are in the kernel of 
the anchor. This implies that their commutator at a certain point m depends 
only on their values at m, not on their derivatives. To find the commutator at 
m, we may therefore replace j!^{vx) by j^(wm) and Hkewise j^{wx) by j^{wm)- 
We then see that [Vj^(w^), Vj^(w;j;)]™ = [Vj^(w™), Vj^(w™)]™. We already 
know that this is '\/{jl^{[vm,Wm]))m, so that [V(t), V(u)]„ = V([r,w])„. 

The last step is to show that V respects the bracket between j''(r{TM)) and 
r(j'^'°(TM)). Again, let j'^{v) be an element of the former and Tj. = jl^{wx) of 
the latter. Considered as an equivariant vector field on P, the vertical vec- 
tor field V(r) takes the value ap(w^(p)) at p e P. Then [V(j''(w)), V(r)] 
is the Lie derivative along a{v) of the vertical vector field f7p(w^(p)). Dif- 
ferentiating along a{v) is done by considering {p,p') fTp(w^(p/)), differen- 
tiating w.r.t. p and p' separately, and then putting p = p'. This results in 
[CT(w),crp(u;^(p))]po = [a{v),a{w^(^p„))]p„ + a{dy\^(^pg){x ^ w^)), which is in turn 
the same as C7pg{[v,w^,^p„)] +d„U(p„)(m ^ w^))), so that [V(i'=(t;)), V(t)] = 
r]) as required. Therefore V must be a homomorphism on all of 
T{J''{TM)). □ 
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Definition 7.5 A connection V of a Lie algehroid A on a vector bundle E is by 

definition a bundle map of A into DO^{E), the first order differential operators 
on E, which respects the anchor. If moreover it is a morphism of Lie algebroids, 
then the connection is called flat. A flat connection of A on E is also called a 
representation of A on E. 

This explains our notation for the map V induced by a. Given a representa- 
tion V of G, one may form the associated vector bundle E := P xq V. The 
map V then defines a Lie algebroid homomorphism of r(j'^(TM)) into the Lie 
algebroid of first order differential operators on E. (Simply consider a section 
of -E as a G-equivariant function P ^ V, and let r'^(TP) act by Lie deriva- 
tive.) By definition, this is a flat connection, or equivalently a Lie algebroid 
representation. 

7.4 The Classification Theorem 

We use the fact that V : T{ J''{TM)) T{TP/G) is a honioniorphism of Lie 
algebroids to find a corresponding homomorphism of Lie groupoids. This will 
give us the desired classification of infinitesimally natural principal fibre bundles. 

7.4.1 Integrating a Homomorphism of Lie Algebroids 

The following theorem states that homomorphisms of Lie algebroids induce ho- 
momorphisms of Lie groupoids if the initial groupoid is source-simply connected. 

Theorem 7.11 (Lie II for algebroids) Let G and H be Lie groupoids, with 
corresponding Lie algebroids A and B respectively. Let V : A — > _B be a homo- 
morphism of Lie algebroids. If G is source-simply connected, then there exists 
a unique homomorphism G ^ H of Lie groupoids which integrates V. 

Remark The result was probably announced first in [Pra66] , and proofs have 
appeared e.g. in [MXOO] and [MM02]. We follow [CF03], which the reader may 
consult for details. 

Sketch of proof. The idea is that V allows one to lift piecewise smooth paths 

of constant source in G to piecewise smooth paths of constant source in H. 
Source-preserving piecewise smooth homotopies in G of course do not affect 
the endpoint of the path in H , so that, if G is source-simply connected, one 
obtains a map G —?' H hy identifying elements g of G with equivalence classes 
of source preserving paths from ids(g) to g. One checks that this is the unique 
homomorphism of Lie groupoids integrating V. □ 

Unfortunately, G^{M) is not always source connected, let alone source-simply 
connected. Recall that G'+*'(M) is the maximal source-connected Lie sub- 
groupoid of G^{M), and therefore has the same Lie algebroid j''{TM). 

We define G+'^(M) to be the set of piecewise smooth, source preserving 
paths in G+'^(M) beginning at an identity, modulo piecewise smooth, source 
preserving homotopies. It is a smooth manifold because G+'^(M) is, and a Lie 
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groupoid under the unique structure making the projection on the endpoint 
G+'^(M) — > G'^^{M) into a morphism of groupoids. Explicitly, the multiplica- 
tion is given as follows. If g{t) a path from id^ to g{^)m'mj and h{t) a path 
from idm,' to h{l)m"m', then the product [h] o [g] is [{h ■ g{l)) * g], where the 
dot denotes groupoid multiplication and the star concatenation of paths. The 
proof of associativity is the usual one. 

Note that the source fibre G'^''{M)^,^m is precisely the universal cover of the 
connected component of the k^^ order frame bimdlc G+'^(Af)*,„i = F+'^(M). In 
order to cut down on the subscripts, we introduce new notation for G~^^ (^Af^^^j^i 
and its structure group G~^''{M)^^^. 

Definition 7.6 We denote thejaniversal cover of the connected component of 
the k^^ order frame bundle by F+'^(M), and its structure group by G{k,M). 

It is an infinitcsimally natural bundle because F^''(M) is. Note that G{k,M) 
is not the universal cover of G^^{M), but rather its extension by tti{F''{M)). 
As 7ri(F'^(M)) — ni{F{M)), we have the exact sequence of groups 

1 ^ 7ri(F(M)) ^ G(fc, M) ^ G+%{M) ^ 1 . 
The group G^„^{M) in turn is isomorphic to Gg o(K") if M is orientable, and 

^0,0 1 



to G§o(K") if it is not. 



7.4.2 Claissification 

Now that we've found a sourcc-simply connected Lie groupoid with J^{TM) as 
Lie algebroid, we can finally apply Lie's second theorem for algebroids to obtain 
the following. 

Proposition 7.12 If a splits the exact sequence of Lie algebras ('7.2), then it 
induces a morphism of groupoids expV : G+*' — )• {P x P)/G such that the 
following diagram commutes, with exp^ the flow along a vector field starting at 
idm- 

exp„ 
^ V{TP/G) — 




Proof. As G+*'(M) is a source-simply connected Lie groupoid with j'^iTM) as 
Lie algebroid, we can apply Lie's second theorem for algebroids. □ 

It is perhaps worth wile to formulate this for general transitive Lie groupoids, 
as it clarifies the link with the recent work of Grabowski, Kotov and Poncin 
[GKP09]. For the Atiyah algebroid A of a principal fibre bundle with a con- 
nected, reductive structure group, they classify Lie algebra isomorphisms of 
V{A) in terms of the Lie algebroid isomorphisms of A. 
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Proposition 7.13 Let Q ^ M he a transitive Lie groupoid, with Lie algebroid 
A. The kernel of the anchor K is then a bundle of Lie algebras with fixed 
dimension d. Suppose that the sequence 

r{K) -s> r{A) -> r{TM) o , 

with K the kernel of the anchor, splits as a sequence of Lie algebras. Then this 
splitting is induced by a morphism of Lie algebroids V : J^(TM) — )■ A, and 
there is a corresponding morphism of Lie groupoids G~^''{M) Q. The number 
k is at most 3 if d is 2 and dim(M) = 1, and at most d otherwise. 



Proof. (Or rather a flimsy sketch thereof.) Analogous to the case of the gauge 
groupoid. □ 

We have paved the way for a classiflcation of inflnitesimally natural principal 

fibre bundles. 

Theorem 7.14 Let n : P ^ M be an inflnitesimally natural principal G- 
bundle with splitting a of (7.2). Then there exists a group homomorphism p : 
G{k,M) G such that the bundle P is associated to F^''{M) through p, i.e. 

P-:^F+^{M) XpG. 

Moreover, a is induced by the canonical one for F^^ (M) . 



Proof. Fix a base point m on M. The map cxp V yields a homomorphism of 
groups p : G^^{M) — >• ((Px P)/G)m,mi the latter isomorphic to G, the former 
io G{k,M). 

The map G+1(M) Xp ((P x P)/G)„,,„, ^ ((P x P)/G),,„,, which is given by 

{dm' ,mtPm,m 

) {exp'V {gm' ,m)) ■ Pm,m IS well defined and injective because two 
pairs share the same image if and only if they are equivalent modulo G+'^„(M). 
It is also surjective and G-equi variant, and hence an isomorphism of principal 
G-bundles. As ((P x P)/G),,„ ~ P and G+^(M) x^ ((P x P)/G)„,„ ~ 
F+^(M) XpG, the equivalence is proven. The remark on a follows from the 
construction. □ 

This classifies the infinitesimally natural principal fibre bundles. They are all 
associated (via a group homomorphism) to the bundle Gt,m — F^'^{M). 

The classiflcation of natural principal flbre bundles is now an easy corollary. 
The following well known result ([PT77], [Ter78]) states that they are precisely 

the ones associated to G'; „^{M) = F''{M). 

Corollary 7.15 Let n : P M be a natural principal G-bundle with local 
splitting E of (7.1). Then P is associated to F'^{M). That is, there exists a 
homomorphism p : Gq q{W^) — )• G such that 

P-:^F^{M) XpG. 

Moreover, S is induced by the canonical one for F'' (M) . 
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Proof. As the homomorphism E : Diffc(M) — >■ Autc(P) is local, it induces a 
homomorphism of groupoids E : Gcrm(M) ~^ {P x P) /G, with Gcrni(Af) the 
groupoid of germs of diffeomorphisms of M. We need but show that E factors 
through j'' : Germ(M) G^{M) for some A; > 0, cf. the proof of theorem 7.14. 

The Lie algebra homomorphism a : Tc{TM) TP/G defined by (7{v) := 
9t|oE(exp(to)) is local by assumption, and according to proposition 7.7 it factors 
through the /c-jets for some fc > 0. It suffices to show that T,{(j))m,m = idm,m 
for any (p G Gcrm„j,„i(M) that agrees with the identity to /c*^ order at m. 

In local co-ordinates {x'}, we write = + w*(x), where v : M" — >■ 

M" vanishes to k^^ order. We define the one parameter family of germs of 
diffeomorphisms (pKx) .t* + tv'^(x). Then dt\T^((f)T)^^^{4>t)m = 0, as it 
equals amidtlr'pT^'Pt), the image of a vector field that vanishes to order k at m. 
Therefore 1 1-)- ^{(f)t)m,m is constant, and T,{^)m,m = idm,m as required. □ 

To summarize: natural principal fibre bundles are associated to a higher frame 
bundle, whereas infinitesimally natural principal fibre bundles are associated to 
the universal cover of a higher frame bundle. 

7.4.3 The Bundle F+^{M) 

The above considerations prompt a few remarks on the universal cover of the 
connected component of the frame bundle F+'°(M), and on its (disconnected) 
structure group G{k,M). Recall that they are just the source fibre (5+^(M) 
and isotropy group G+^„(M) of (5+'=(M). 



General Manifolds 

If TTi (M) is the homotopy groupoid of M, define the homomorphism of groupoids 
Pr : ■!Ti{M)m',m T^o{Gm' ,miM)) by lifting a path in M to a path in G^{M) 
with fixed source, and taking the connected component of its end point. It 
makes 

7ri(M)m',m 

G+^,™(M) 7ro(Gt,„(M)) 

into a commutative diagram. 

Define {G^{M) x ■nx{M)f'' to be the groupoid of pairs (.g, [/]) such that 
7ro(5) = Pr([/]). If M is orientable, this is simply G+'=(M) x 7ri(M). The map 
of groupoids {&{M) x 7ri(M))P'' is well defined and surjective. It 

restricts to a covering map of principal fibre bundles 

r : G+1(M) ^ {G\M) x TriCM))^;^ . (7.7) 

The kernel of the corresponding cover of groups is precisely «*7ri(G+'^^(M)), 
with % : G+'^„j(M) G^^-JyM) the inclusion. Note that has a nonzero 
kernel precisely when a vertical loop is contractible in G+^(M), but not by a 
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homotopy which stays inside the fibre. Denoting 7ri(G+'^„(M)) by Z, we obtain 
the exact sequence 

1 ^ Z/Ker(u) h G+%{M) A (G"=(M) x 7ri(M))^^,„ ^ 1 . (7.8) 

A moment's thought reveals that this extension is central: if g{t) is a path in 
G+%{M) and h{t) one in G+^(M), then both h*{iog) and {iog)- h{l) * h 
can be homotoped into 1 h{t)g{t). 

We may as well restrict attention to the case fc = 1, in which ^(M) is the 
frame bundle F{M). Indeed, as G+^(M) -> GI'X{M) has contractible fibres, 
G+^(M) is just the pullback of G+^(M) along Gi;+ (M) ^ Gi;+ (M). 

Orientable Manifolds 

For orientable manifolds, the situation simplifies. If we identify the connected 
component of G^„(M) with GL'''(M"), we obtain a homomorphism of 
GL+(M") into G^+„(M). There is a second homomorphism 7ri(i^(M)) 
G^^(Af). Their images intersect in Z/Kcr(?'*), and commute by an argument 
similar to the one on centrality of (7.8). If we define (GL+(M") x 'r:i{F{M)))z 
to be the quotient of GL+(IR") x 7ri(F(M)) by the equivalence {gz, h) ~ {g, zh), 
we can regard it as a subgroup of G^^(M). Note that if Ker(i») is nonzero, 
the above equivalence relation sets it to 1. 

If M is orientable, we may restrict our attention to F+(M), which has 
connected fibres. Any path in F~^{M) which starts and ends in the same fibre 
can therefore be obtained by combining a closed loop^ith a path in GL^(M"). 
For orientable manifolds, we thus have Gl;+^{M) ~ (GL+(K") x 7ri(F+(M)))x, 
and in the same vein 

G{k,M) ~ (G(A;,M") x Tn{F+{M)))z. (7.9) 

Spin Manifolds 

Let M be an orientable manifold, equipped with a pseudo-Riemannian metric 
g of signature r? e Bil(M"). Then OF+ := {/ e F+{M) | f*g = r?} is the bundle 
of positively oriented orthogonal frames. A spin structure is then by definition 
an S0(r7)-bundle^ Q over M, plus a map u : Q ^ suchjhat the following 

diagram commutes, with k the canonical homomorphism SO(r?) SO(r/). A 
manifold is called spin if it admits a spin structure. 

^There is a subtlety here. Suppose 77 has indefinite signature, say (3, 1). The group SO(3, 1) 
has 2 connected components, so that a universal cover does not exist. As it is a subgroup 

of the simply connected group GL+(IR'l), we simply define SO{3, 1) to be k-1(SO(3, 1)) with 
K : GL+(R*) — > GL+(R*) the covering map. Thus SO(3,l) is, perhaps surprisingly, not 
isomorphic to the 2-component spin group Spin(3, 1). Indeed, if T is time inversion and P is 
the inversion of 3 space co- ordinates, then (Pr)2 = 1 in SO(3, 1), as opposed to {PT)^ = -1 
in Spin(3,l). Therefore 7r-i(±l) ~ Z/2Z x Z/2Z in SO(3, 1), whereas 7r-i(±l) = Z/4Z in 
Spin(3,l) (see [BW+01]). ^ 

Of course the connected component of unity of S0(3, 1) and that of Spin(3, 1) are both 
isomorphic to SL(C^), so that none of this is relevant if M is both orientable and time- 
orientable, i.e. if the structure group of the frame bundle reduces to SOl^(3, 1). 
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S0(7?) SO(r?) 




M 



Define Q := QXgQ, -GL+(K"), and let us again denote tiie induced map 
Q F+{M) by u. As any cover of F+(M) by a GL+(M")-bundle can be ob- 
tained in this way, there is a 1:1 correspondence between spin covers of OF^{M) 
and F~^(M). In particular, whether or not M is spin does not depend on the 
metric. 

The Serre spectral sequence gives rise to the exact sequence 

1 ^ Z/Kei{u) ni{F+{M)) ni{M) 1 . (7.10) 
The following proposition is well known. 

Proposition 7.16 A spin structure exists if and only if i^, : Z ^ tti{F~^{M)) 
is injective and (7.10) splits as a sequence of groups. If spin structures exist, 
then equivalence classes of spin covers correspond to splittings of (7.10). 

Proof. This will follow from theorem 7.23 later on. but see e.g. [MorOl] for an 
independent proof. Our criterion for M to be spin is equivalent to the vanishing 
of the second Stiefel- Whitney class, see e.g. [LM94]. □ 

Remark In terms of group cohomology, one can consider the sequence (7.10) 
as an element [uj] G i/^(7ri(M), Z/Ker(i*)). Spin bundles exist if and only if 
both Ker(i*) and [u] are trivial, in which case they are indexed by H^{'jti{M), Z). 

If a spin structure exists, then F^ is simply the puUback along the universal 
cover M ^ M oi Q ^ M. The picture then becomes 



F+^{M) > u^{F+''{M)) > F+^{M) 



F+{M) 



Q 



F+{M) 



M > M 

with each of the three squares a pullback square. 



7.5 More General Fibre Bundles 

In this section, we will prove a version of theorem 7.14 for fibre bundles which 
are not principal. It would however be overly optimistic to expect an analogue 
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of of theorem 7.14 to hold for arbitrary smooth fibre bundles, so we will restrict 
ourselves to those bundles that carry a sufficiently rigid structure on their fibres. 

7.5.1 Structured Fibre Bundles 

We start by making this statement more precise. 

Definition 7.7 Let C be a subcategory of the category of smooth manifolds such 
that the group of automorphisms of each object of C is a finite dimensional Lie 
group. Then a 'structured fibre bundle' with structure C and fibre Fq G ob(C) 
is by definition a smooth fibre bundle tt : F — >■ M where the fibres are objects 
of C. We also require each point to possess a neighbourhood U and a local 
trivialization (f) : Tr~^{U) X U such that (p restricted to a single fibre is a 

C-isomorphism 7r~^(m) — > Fq. 

For example, a structured fibre bundle in the category of finite dimensional 
vector spaces is a vector bundle. 

If TT : F ^ M is any smooth fibre bundle, then an automorphism of it is by 
definition a difi^eomorphism q of F such that 7r(/) = n{f') implies 7r(a(/)) = 
7T{a{f')). It is called vertical if it maps each fibre to itself. 

Definition 7.8 We define an automorphism of a structured fibre bundle to be 
an automorphism of the smooth fibre bundle such that its restriction to each 
single fibre is an isomorphism in C. 

One can then construct a sequence of groups 

1 Aut^(F)^ Autf (F) Difff (M) 1 (7.11) 

and its corresponding exact sequence of Lie algebras 

^ T^{TF)^ T^{TF)'" Vc{TM) -> , (7.12) 

where V is for vertical, 'P' for projectable, and c again stands for '0 outside a 
compact subset of M\ 

The proof of the following corollary of theorem 7.14 is now a formality. 

Corollary 7.17 Let n : F ^ M be a structured fibre bundle with fibre Fq such 
that (7.12) splits as a sequence of Lie algebras. Then there exists an action p 
of G{k, M) by C- automorphisms on a single fibre F^ such that 

F = F+''{M) XpFo. 

Proof. Construct the principal Aut*^(-Fo)-bundlc n : P ^ M, the fibre over x 
of which is precisely the set of C-isomorphisms cp '■ ^ F^. Then there is a 
natural isomorphism Aut^(_F) ~ Autc(P)'^ under which the vertical subgroups 
of the two correspond, so that the exact sequence (7.11) is isomorphic to (7.1), 
and therefore (7. 12) to (7.2) . As F = P x Aut(_F„) -fro) we can now apply theorem 
7.14 to P in order to substantiate our claim. □ 
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7.5.2 Vector Bundles 

We specialize to the case of vector bundles. These are precisely structured fibre 
bundles in the category of finite dimensional vector spaces. 

The exact sequence of Lie algebras (7.12) for a vector bundle E with fibre 
V is then 

1 ^ D0°(£) ^ BOl{E) ^ Tc{TM) , (7.13) 

where DOl{E) is the Lie algebra of compactly supported order differential 
operators on T{E), and DO"(i?) the ideal of O"^ order ones, that is to say 
DO°(S) ~ Tc{E(g)E*). 

Corollary 7.17 then says that (7.13) splits as a sequence of Lie algebras if and 
only if there is a representation p of G{k, M) on V such that E ~ F"'"'^(M) XpV . 

But thanks to the fact that all finite dimensional representations of the 
universal cover of GL+(]R") factor through GL"'"(R") itself, we can even say 
something slightly stronger. 

Proposition 7.18 Let E ^ M be a vector bundle for which (7.13) splits as 
a sequence of Lie algebras. Then there exists a representation p of the group 
{G^ X 7ri(M))^''„j on V such that 

E:^{G''{M)xn,{M)f^^^x,V. 

Remark If M is orientable, this reads E ~ 7r*F+'^(M) XpV. In this expres- 
sion, 7r*F+'^(Af) is the puUback of i^+'^(M) along tt : M — )• M, considered as a 
principal G^oiR"^) X 7ri(M)-bundle over M. 

Proof. Consider the restriction of the map t in equation (7.7) to the group 
G+^„(Af). In order to prove the proposition, we need but show that its kernel 

Z acts trivially on V. For fc = 0, this is clear. 

If k is at least 1, the homomorphism GL+(M") G^^^{M) makes V into 

a finite dimensional representation space for GL+(]R"). But it is known (see 
[Kna96, p. 311]) that all finite dimensional representations of its cover factor 

through GL'''(IR") itself. This implies that the subgroup Z which covers the 
identity must act trivially on V, and we may consider 

GtXiM)/Z {G\M) X n^{M))l^^^ 

to be the underlying bundle, as announced. □ 

This reduces the problem of classifying vector bundles with split sequence (7.13) 
to the representation theory of (G'^ x tti{M))^ ^. 

The above extends a result [Ter78] of Terng, in which she classifies vector 
bundles which allow for a local splitting of the sequence of groups (7.11). It is 
an extension first of all in the sense that we prove, rather than assume, that the 
splitting is local. Secondly, wc have shown that in classifying vector bundles with 
split sequence (7.13) of Lie algebras rather than groups, one encounters only 
slightly more. Intuitively speaking, the extra bit is the representation theory 
of 7ri(M). We refer to [Ter78] for a thorough exposition of the representation 
theory of Gg o(M"). 
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7.6 Flat Connections 

Having concluded our classification of bundles in which (7.2) is split as a se- 
quence of Lie algebras, the time has come to apply our newly acquired knowl- 
edge. 

In this section, we will investigate splittings that come from a flat equivariant 
connection on a principal G-bundle P — > M. We will prove that if the Lie 
algebra g of G does not contain s[(M") as a subalgebra, then the sequence of 
Lie algebras (7.2) splits if and only if P admits a flat equivariant connection. 
In other words, the sequence (7.2) then splits as a sequence of Lie algebras if 
and only if it splits as a sequence of Lie algebras and C°°(M)-modules. 

Note that this is certainly not the case for general groups G. The frame 
bundle for example always allows for a splitting of (7.2), but usually not for a 
flat connection. 

7.6.1 Lie Algebras That Do Not Contain si{W") 

Although lemma 7.6 exhibits cr as a differential operator of finite order, the 
bound on the order is certainly not optimal. With full knowledge of the Lie 
algebras at hand, sharper restrictions can be put on the kernel of a. In partic- 
ular, if Q does not contain 5[(M"), there is only a single relevant ideal, and a is 
of order at most 1. For notation, see section 7.2.2. 

Lemma 7.19 Let n = 1, and let g be such that it does not contain two nonzero 
elements such that [X,Y] = Y. Or let n > 2, and let g be such that it does 
not admit sl(]R") as a subalgebra. Then the kernel of the homomorphism fm '■ 
Vec„ — >■ contains {v G Vec„ | Divm(w) = 0}. 

Proof. We start with the case n = 1. Again, we note that the only ideals of 
Veci = Spanjx'^Sa; | A: > 1} are Spa.nlx^d^jx'^dx | fc > 4}, and for each A'' > 1 an 
ideal Span{x''dx \ k > N}. The corresponding quotients all contain two elements 
X and Y with [AT, y] = Y, except the ideals corresponding to A^ = 1,2. This 
means that also q, containing Veci/ker(/m) as the image of fm, will possess 
X and Y such that [X, Y] = Y unless the kernel of fm contains the ideal 
Spanix'^dx \k>2} = {v & Veci | Div„(i;) = 0}. 

Now for n > 2. Under the identification Vcc° ~ 0[(M") given by Xidj i->- e^, 
the Euler vector field is the identity 1 and Div^ becomes the trace. As ker(/m)° 
is an ideal in b((M"), it can be either 0, Ml, s((E") or Rl 51(1^"). In the 
former two cases, lm.{fm) — Vec„/ ker(/TO), and hence Q, would contain 5[(M") 
as a subalgebra, contradicting the hypothesis. Hence s((]R") C ker(/m). If we 
now show that [Vec„.s[(M")] = s[(M") Vcc^, the proof will be complete. 

Let i ^ j. We then have [xidj,XjX"dj] = {aj + l)xiX°'dj, showing that 
XiX^dj e [Vcc„, s[(M")]. The only basis elements not of this shape are of the 
form Xjdj. But [xjdi,Xix'^~^dj] = x'^dj — XiX^^^di. If fc > 2, the latter part 
was just shown to be in [Vec„, s[(M")], so that also x'^dj e [Vec„,sl(R")]. If 
k = 1, the elements xjdj — Xidi join Xidj to form a basis of sl(M"). □ 
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This rather limits the possibilities. Not only can we restrict to first order, but 

also the Lie algebroid map V : J^{TM) — )■ TP/G vanishes on the trace-zero jets 
= {jLi'>j) e J^{TM) I v{m) = and tr(t>) = 0}, so that it factors through 
the 'trace Lie algebroid' tr^(M) := J^{TM)/Km. 

This in turn is the Lie algebroid of the 'determinant groupoid' Det(M). An 
element [a]m',m of Det{M)jn' ,m is by definition an equivalence class of diffeo- 
morphisms mapping m to to', with a ~ /3 if and only if Det(/3~^a) = 1. 

As [a].m',m identifies A"(T;,M) with A"(T4,M), the source fibre Det(M)*,m 
is isomorphic to the determinant line bundle A"(r*M) — >■ M. Its^ connected 
component A" +(r*-M) is the the bundle of positive top forms if M is orientable, 
and the whole bundle otherwise. 

Its universal covering space is the bundle A"'+(T*M) of positive top forms 
on M. Indeed, M is always orientable, regardless of whether or not M is. This 
means that A"(T*M) is a trivial bundle, and that its connected component 
/\"'+(T* M) ~ M X M+ is simply connected. The covering map is induced by 
the map M — )■ M. This leads to the following version of theorem 7.14. 

Proposition 7.20 Let P be a principal G-bundle over an n-dimensional man- 
ifold M. Let G be such that its Lie algebra g does not contain s[(M") if n > 1, 

or [X, Y] = Y if n = 1. Then there is a homomorphism tti (M) x R+ — ^ G 
associating P to the principal ■ni{M) x -bundle A"'+(T*M) M. 

p^A"'+(r*M)x,^(M)xM+G. 

We may even classify the possible splittings. 

Corollary 7.21 Under the hypotheses of proposition 7.20, any Lie-algebra ho- 
momorphism (J : TciTM) Tc{TP)'^ which splits the sequence of Lie algebras 
(7.2) can be written 

a = V''+ADiv^, (7.14) 

where is a fiat equivariant connection on P, and A a section of ad{P) which 
is constant w.r.t. the connection induced on ad(P) by V. 

Remark In particular, this shows that there exists a flat connection which 
splits (7.2), even though most splittings are not flat connections. 

Proof. First, we prove the case P — A"'"'"(T*M). Pick a nonzero (pseudo-) 
density fi on M. This induces an honest density /}, on M, which in turn identifies 
A"'+(T*M) with M xR+. The local trivializations of A"'+(T*M) M and 
M M combine to locally trivialize A"'+(r*M) M. This yields a flat 
equivariant connection on A"'+(r*M), which annihilates /i. 

The splitting a is uniquely determined by the action of (t{v) on local sections 
which reads a{v){i/) = tt* o £^ o 7r*^^i>, where tt is the map from M to M. 
If we define the divergence w.r.t. /i by the requirement that the Lie derivative 

■*An isomorphism A"(T*M) ~ Dct(]\I),^m is only given after a choice of Ao G A"(T^M). 
This determines the connected component. 
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Cyfi equal Div^(u)/x, then we have 

a{v)Uti) = TT* 

= v{f)il + DiY^{v)fjl 
= V!£{f(l) + Biy^{v)fil. 

This shows that a{v) = + ADivi^{v), with A = d^, the equivariant vertical 
vector field defined by the action of on A"(r*M). (Equivariant vertical 
vector fields on P correspond to sections of ad(P).) 

The general case follows by proposition 7.20. □ 



7.6.2 Lie Algebra Cohomology 

If we specialize to the case of a trivial bundle over an Abelian group G, we find 
ourselves in the realm of Lie algebra colioniology. The continuous cohomology 
of the Lie algebra of vector fields with values in the functions has already been 
unravelled in all degrees [Fuk86]. Corollary 7.21 describes this cohomology only 
in degree 1, but now with all cocycles rather than just the continuous ones. 

Corollary 7.22 Let Hla denote Lie algebra cohomology and HdR de Rham 

cohomology. Let g be Abelian, and consider the representation C^{M,q) of 
Tc{TM) where a vector field v acts by the Lie derivative Cy. Then 

Hi^{T,{TM), Cr(M,fl)) ~ HUM,Q) e . 

Proof. Consider the trivial bundle M x G — >■ M over an Abelian Lie group 
G, which comes equipped with a flat connection V°, which acts as Lie deriva- 
tive. Note that Abelian g certainly satisfy the conditions of propositions 7.20 
and 7.21. View rc(ad(P)) ~ C^{M,q) as a representation of T^{TM), and 
consider its Lie algebra cohomology. An ?>cochain is an alternating linear map 
rc(TM)" ^ G^{M,q). For e C\ closure 6f^ = amounts to 

Cyf\w)-C^f{v)-f\[v,w])=(^- 

Due to this cocycle condition, a = + is once again a Lie algebra homo- 
morphism splitting tt*. According to corollary 7.21, it must therefore take the 
shape (T = + ADiv^,, where A G g is constant. One can write V'^ = + 
for some closed 1-forni uj^ , so that f^ — u!^ + ADiv^. This classifies the closed 
1-cocycles. 

Exact 1-cocycles satisfy f^{v) = Sf^iv) = Cyf° = df°{v), with f° a 0- 
cocycle, that is an element of C^{M, g). 

Note that a change of density fi' = eJ' fi, alters f^ by a mere coboundary Adh, 
so that the choice of ^ is immaterial. The class of + ADiVj^ modulo is 
therefore determined by [ui^] e H^j^{M,g) and A e g. □ 
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Continuity turns out to be implied by the closedness-condition. A similar situ- 
ation was encountered by Takens in [Tak73] , when proving that all derivations 
of T^iTM) are inner, i.e. Hl^{rc{TM),T^{TM)) = 0. 

7.7 General relativity, Gauge theory, and Spinors 

In this section, we briefly reflect on the relationship between (generalized) spin 
structures and principal fibre bundles with a sphtting of (7.2). 

We formulate general relativity and gauge theory, including the fermionic 
fields, in terms of principal fibre bundles over the manifold M which describes 
space-time. For convenience, we will take M to be a smooth and orientable 
manifold of dimension at least 2. 

7.7.1 General Relativity 

The fundamental degrees of freedom in general relativity are a pseudo-Riemann- 
ian metric g on space-time M, and a connection V on TM. 

We identify the metric with a section of F+(A'/)/SO(t/) by associating to 
: T^M x T^M M the coset of all frames / : M" ^ T^M such that J*g = i]. 
We identify the connection on TM with an equivariant connection on F^{M), 
that is a section of ji(F+(M))/GL+(M") M. (Its value at x is the class of a 
1-jet at X of a section S r(F+) with Wx4' — [Sar09].) These two, the metric 
g and the connection V, are conveniently combined into a single section of the 
fibre bundle J^{F+{M))/SO{r)). 

The dynamics of the theory are then governed by the Einstein-Hilbert action 
Seh ■■ T{J^{F+{M))/SO{r])) R, defined in terms of the Ricci scalar R by 



In the Einstein-Hilbert approach, the connection is constrained to equal the 
Levi-Civita connection, and only the metric is varied. In the Palatini approach 
[Ash91], the connection varies independently, and the fact that V is the Levi- 
Civita connection of the metric is a consequence of the field equations. 

Either way, it is clear that the fields transform in a natural fashion under 
diffeomorphisms of M. Any orientation-preserving diffeomorphism a of M lifts 
to an automorphism T,{a) of {F+{M))/ SO {r]), defined by S (a) (j^ ((/))) = 
j^{ct^ o(l)oa~^). This splits the sequence of groups 



1 Aut^(ji(F+(M))/SO(r?))) ^ Axit{j\F+ {M))/S0{7])) DiS+{M) 1 . 



This splitting is central to the theory of general relativity. The requirement 
that the action be invariant under co-ordinate transformations, SEHi'^ict)4>) = 
SehW, cannot even be formulated without providing S explicitly. 

Note that as the above sequence of groups splits, so does the corresponding 
sequence of Lie algebras of vector fields. 
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7.7.2 Fermions and Spin Structures 

We wish to describe fermions. As these are known to transform under Lorentz 
transformations by a projective representation rather than a linear one, we must 
extend our framework. 

The Bundle 

Suppose that we have a spin structure Q w.r.t a background metric g. Let u : 
Q -^F^{M) be as in section 7.4.3, and let F be a unitary spinor representation^ 
of SO{ri). Then one has the composite bundle 

A section t : M ^ Q x ^^^^-^V can then be interpreted as a metric g along with 

a fermionic field ijj. Consider g as the induced section of F^{M)/SO{ri). Use g 
to construct the spinor bundle u~^{OF^) Xso(^) ^) ^^'^ obtain a section tp by 
simply restricting the image of r. 

In the same vein, wc will describe physical fields by sections of the fibre 
bundle J^{Q)xgQ{ri)V. This is equivalent to providing a triple of sections: one 
of F+ (M) /SO(r?), one of (F+ (M))/GL+ (K") , and one of u"! (0F+ ) x gQ^^^ V. 
These correspond to the metric ^^y, the Levi Civita-connection F^^, and the 
fermionic field -0' respectively. 

Transformation Behaviour 

Let us investigate its transformation behaviour. As a spinor changes sign un- 
der a 27r-rotation, there is no hope of finding an interesting homomorphism 
of groups DiS~^{M) Aut{J^{Q) Xso(77) There is however a canonical 
homomorphism of Lie algebras. 

Because u : Q ^ F+ (M) has discrete fibres, it has a unique flat equivariant 
connection V. This means that the exact sequence of Lie algebras 

re(ad(Q)) ^ rc(TQ)^"'(*'') Tc{TM) 

is split by CT := Vo£), with D : TdTM) r^CTf +(M))C^l+(k") the first order 
derivative D{v)f = 9/9f|o exp(to)« o /. This induces a splitting for J^{Q) by 
prolongation (see e.g. [FR04]), and consequently also one for J^{Q) ^s6{ri) ^" 
Note that o cr equals D. 

We see that Q is an infinitesimally natural bundle, and that the canonical 
splitting a of (7.2) does not come from a splitting of groups. We would like to 
emphasize that even if a splitting at the level of groups does exist, it will not 
be physically relevant, because it cannot reproduce the minus sign under a full 
rotation that one knows and loves in fermions. 

^The indefinite article is appropriate since there is a choice involved here. The connected 
component of 1 of SO(3, 1) is Spin''"(3, 1) ~ SL^(C). A spinor representation for the connected 
component can then be unambiguously derived from a Clifford algebra representation [Her74] . 
But as SO(3, 1) is not isomorphic to Spin(3, 1), the action of the order 2 central elements 
covering PT will have to be specified 'by hand'. Again, this is not relevant if M is time- 
orientable as well as orientable. 
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Take for example the spin structure Q = M" x 80(77) over M", and lift 
a € Diff+(M) to Aut((5) by Y,{a){m,q) = {a{m),q). This the wrong thing 
to do: if we restrict attention to SO (77) < Diff^(M), we see that sections of 
Q Xgo(,,) ^ transform under the trivial representation of the Lorentz group. 
Our fcrmions arc Lorentz scalars rather than spin- 1/2 particles. This is a very 
real problem: using the wrong splitting will generally result in an incorrect 
energy- momentum tensor [GM92]. 

We conclude that not only the bundle Q and the covering map u : Q ^ OFg 

are relevant, but also the splitting a : TdTM) -> Tc{TQ)^^^ ^^"l It must 
satisfy u* o cr = D in order for the metric g € T{TF^ {M) / SO{r])) to transform 
properly. Luckily, such a ct is naturally associated to any ordinary spin structure 
Q. For generalized spin structures however, this is no longer so. 

7.7.3 Gauge Fields and Spin*^-structures 

In the presence of gauge fields, the topological conditions on M in order to 
support a spin structure are more relaxed. Intuitively, this is because the gauge 
group G can absorb some of the indeterminacy that stems from the 2:1 cover 
of the Lorentz group. 

Generalized Spin Structures 

This is made more rigorous by the notion of a generalized spin structure or 
SpinG-structure [AI80]. 

Definition 7.9 ^et G be a Lie group with a central subgroup isomorphic to Z. 
Le^ SpinG — SO{r]) Xz G. A SpinG-structure is a SpinG-bundle Q over M, 
together with a map u : Q ^ ^^^^ makes 

SpinG SO(r?) 



Q 




commute. We again denote the map {x,g) h- > k{x) by k. 

This gives rise to the principal GL(]R") G-bundle Q := Q Xg^^^^GL{R"). If 
G = Z, we recover the notion of a spin structure. Apart from spin structures, 
the best known examples of SpinG-structures are probably the spin'^-structures, 
which are precisely SpinG-structures for G ~ U{1). 

Let y be a representation of SpinG. The 'physical bundle' is then the fibre 
J^{Q) XgpinG V. A single section of J^{Q) Xgpj^G V represents a metric, a Levi 
Civita-connection, a gauge field and a fermionic field. 

^This notation is convenient but slightly misleading. Be ware that if 77 is of signature 
-|- -|- H — , then Spin^ is isomorphic to SO(?7), not to Spin(3, 1). 
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The metric is the induced section of F~^{M)/SO{ri), and the the Levi-Civita 
connection that ctf ji(F+(M))/GL+(M"). One constructs the principal G/Z- 
bundle P Q/GL+(]R"), and the gauge field is the induced equivariant con- 
nection on P, a section of J^(P)/{G/Z). The fermionic field is the induced 
section of 7r^^(OF+) Xso(,,)xzG ^' wli^'^s one should note that the bundle it- 
self depends on g. 

Infinitesimally Natural Spin^-Structures 

We argue that it only makes sense to consider SpinG-structures which admit an 
appropriate transformation law under infinitesimal diS^eomorphisms of space- 
time. We will call these SpinG-structures infinitesimally natural. 

Definition 7.10 A SpinG structure u: Q ^ FOg will he called 'infinitesimally 
natural' if Q ^ M is infinitesimally natural as a principal fibre bundle. More- 
over, we require that the splitting a : Tc{TM) T^iTQ)^^^^"^''^'^ of (7.2) 
satisfy u* o a = D. 

The mathematical requirement that Q — > M be infinitesimally natural as a 
principal fibre bundle corresponds to the physical reqTiirement that fields should 
have a definite transformation behaviour under infinitesimal co-ordinate trans- 
formations. 

The requirement u^oa = D corresponds to the fact that we need to interpret 
a section of (5/GL(M") G ~ F+(M)/S0(?7) as a metric, and we know that 
its transformation behaviour is governed by D. 

We view infinitesimally natural SpinG-structures as the underlying principal 
fibre bundles in any classical field theory combining gravity, fermions and gauge 
fields. Let us work towards their classification. 

Classification of Infinitesimally Natural Spin'^-Structures 

Theorem 7. 14 is of course the main tool when classifying infinitesinially natural 
SpinG-structures. It provides a homomorphism p : G{k,M) — > GL(M") Xz G 
such that Q is isomorphic to F+''{M) Xp (GL(R") Xz G), and a map exp V : 
F^'^{M) — >■ Q. The splitting <t is induced by the lift of the k*"^ order deriva- 
tive i) : T{TM) -i> T{TF+''{M))'^<^'''^\ In summary, we have the following 
commutative diagram. 

D 

re(rF+'=(M))?^'=''" 

exp V* I 
— > Tc{TF{M))^^^^"^ 



exp V* I ^ 

re(Ti?^(M))'2L(M'') 




T^{TM) — > 
id 



re(TM) 



id 



re(TM) 
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The classification theorem for infinitesimally natural SpinG-structures will 
take the following form. 

Theorem 7.23 Let M be an orientable smooth manifold of dimension n > 2. 
Let G be a Lie group with a central subgroup Z isomorphic to 7ri(GL(M")), and 
let Lie(G) be such that it does not contain 5l(M"'). Finally, let {Q,u) be an 
infinitesimally natural SpinG -structure over M. Then ia, : Z ^ tti{F~^ (M)) is 
injective, and there exists a homoraorphism r : tti{F'^[M)) — >■ G which fixes Z, 
and makes {Q, u) isomorphic to the bundle 

OF^g XrG, 

with u the natural projection map onto OF^. 

Remark We have already seen that an ordinary spin structure is always 
infinitesimally natural, so that proposition 7.16 is just the special case G = Z. 

Theorem 7.23 rather simplifies the data needed to construct the relevant bundle 
J^{Q) XgpjjjG V. Indeed, it suffices to have a triple (M, G, [p, V)) of: 

- An orientable manifold M such that : 7ri(SO(?7)) — )• t:i{F{M)) is injec- 
tive. Its image Z = {±1} is then central in 7ri(F(M)). 

- A representation p of S0(?7) Xz 'K\{F{M)) on V which is unitary when 
restricted to ■ki{F{M)), and faithful on Z. 

- A subgroup G < U{V) that commutes with the image of SO(ry) under p, 
and contains the image of 7ri(f (M)). 

We can then form Q = F+{M) ^Tri{F) G, with structure group SpinG = 
S0(7?) XzG. The relevant bundle is J^Q) XgpjjjG V, a single section providing 
the metric, Levi-Civita connection, gauge field and fermions. 

Note that the 'gauge bundle' is the trivial bundle P = M x G/Z, and that 
the bundle describing fermions and metric is F^ (M) XhV, where H is defined 
as H := SO{r]) XzMF)- 

According to theorem 7.23, the above setting exhausts the possibilities at 
least under the (natural) assumptions that Q is infinitesimally natural, and that 
y is a faithful unitary representation for G. Indeed, G must then be compact, 
so that its Lie-algebra cannot allow 5l(M") as a subalgebra. 

Two Lemmas 

We set out to prove theorem 7.23. We start with two lemmas designed to 
explicate the homomorphism p : G{k, M) GL(M") XzG. 

Lemma 7.24 Denote the natural projection map F^^{M) — > F+(M) by n, 
and write v : G{k,M) GL+(M") for the corresponding homomorphism of 
groups. Then each infinitesimally natural SpinG -structure is isomorphic to one 
for which ko p ^ u and u o exp V = tt. 
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Proof. Denote uoexpV by 7. Pick / e F+'=(M), and let 7r(/) = /. There 
exists a c G GL~''(K") such that ■y{f) = fc. Equivariance of 7 then implies 
lify) = fcK{p{y)) for^all y G G(fc, M). 

We required 7* o D = D, but clearly we also have n^, o D = D. Therefore, 
if exp(_D(w))/ = fy and exp(D(u))/ = fy, we must have both i'{y) = y and 
f^ipiy)) = c-^yc. 

We may obtain all y in G{k,M)^, the connected component of G{k,M), 
by choosing an appropriate u, so that we have K{p{y)) = c~^v{y)c for all y € 
G{k, M f' . Although c depends on / a priori, it turns out to be constant up 
to scaling. Indeed, as both k and p are constant, so is the adjoint action 
of c. Moreover, / 1— > c(/) is invariant imder G{k,M)^ , making it a function 
on F+''{M)/G{k,M)° ~ M. All in all, we have established that there exist 
Co € SL(R") and h : M ^ M+ such that c(/) = coh{x), with i the projection 
of / to M. We may write u o exp V(/) = Tr{f)cQh{x). 

We show that we may as well take c and to be Pick a c G SL(M") which 
covers c, and construct the bundle Qc ■= Q XAd(c) GL(]R"). It is isomorphic to 
Q, with isomorphism Qc ^ Q given by [q, y] h- > qc~^h~^{x)y. 

If we simply pull back the covering map on Q, we obtain Uc'- Qc^ F^{M) 
given by {q,y\ ^ u{qc~^h~^{x)y). This makes Qc ■= u~^{OFg) into a SpinG- 
structure isomorphic to Q, but with the desired properties. □ 

Recall from (7.9) that G{k,M) ~ G{k,W^) 7ri(F+(M)), and that we have 
G{k,W) ~ GL+(M") K G>\ with G>^ the subgroup of A:-jets that are the 
identity to first order. We unravel p, considering it as a map 

SL(M") xM+ kG>i X2 7ri(i^+(Af)) ^ SL(M") xM+ x^G. 

Lemma 7.25 Under the assumptions of theorem 7.23, the map p is completely 
determined by a homomorphism p\ : 7ri(F+(M)) G fixing Z, a homomor- 
phism p2 : 7ri(F+(M)) 1R+, and an element A 0/ Lie(G) which commutes 
with Im(pi). We have 

p: {x,e\g,\p]) ^ (x, e*p2([p]), e*V(b])) • 

Proof. Consider p as a Lie algebra homomorphism from sl(R"') x R k to 
5[(M") x M X Lie(G), and let pij be its {i,j) component for i,j e {1,2,3}. 

Because s[(]R") is simple and not contained in Lie(G), we must have pi^ = 0. 
Due to the previous lemma, pi2 = and pn = id. As SL(IR") is simply 
connected, we must have (i, 1, 1, 1) (i, 1,1). In particular, this forces the 
map : Z — )• 7ri(F+(M)) to be injective. 

Again due to the previous lemma, p2i — and P22 = id. Define the 'scaling 
element' A := P23(l)- Then (1, e*, 1, 1) ^ (1, e*, e*^). The image of 7Ti{F+{M)) 
must commute with (x, e*, e*"^), so that p : 7ri(F+(M)) — >■ M"*" xG is well defined, 
and A commutes with its image. 

We now show that p(fl>^) = 0. First of all, as [sl(M"),s[(M") + g>'^] equals 
5[(K") + g^^ if n is at least 2 (see Lemma 7.19), we must have 

p(b>') C [p(s[(K")), /9(st(M") + 0>i)] c st(M") ® © . 



182 



Chapter 7 



But on the other hand, as [M,f|-^^] = g-^^, (recall that K represents the Euler 
vector field), we have 

= [pW. p{&^')] c e e Lie(G) . 

The intersection being zero, we have = 0. But then p{G^^) = 1, because 

G^^ is simply connected. □ 



Proof of Theorem 7.23 

Clearly, M+ cannot cause any topological obstruction, so we should be able to 
eliminate both p2 and A from the story. The former is easy, recall that we seek 
Q, not Q. As Q is a subbundle of Q/M"*", we will focus on the latter from now 
on, allowing us to simply disregard p2- 

In order to remove A, we choose a volume form A on Ad . This endows each 
frame / € Fx{M) with a volume vo1a(/). A frame has volume 1 precisely 
when it is the jet of a diffeomorphism which preserves A. Denote by F^{M) 
the SL(IR")-bimdlc of frames with volume 1. Its universal cover F^{M) is its 
inverse image under tt, a principal SL(]R") Xz 7ri(F+(M))-bundle. 

Define the isomorphism 

Q^FxiM) x,^(^+(M)) (GxM+) 

by 

if, 9) ^ (/vol^i(7r(/)), exp(log(voU(7r(/)))A)5,voU(7r(/))) 

where we consider Q as (F^''{M) Xp^^^ G). One can see that it is well defined, 
and that it intertwines the natural maps to F/M.'^. 

This shows that Q/M.'^, and therefore the spin structure {Q, u), is completely 
determined by the homomorphism pi : tti{F~^{M)) G. We denote it by r 
from now on. 

If we choose A to be the volume form induced by the metric g, tlien^OF+ is 

a subbundle of Fx (M) . Since tti {F+ (M))^ tti {OF+), we have Q - OFj x^G. 
The spin map u is simply the projection O-F^ — >■ OF^ . The principal G- bundle 

Q — )■ OFg has a flat equivariant connection induced by the one on OF+ — >• OF^. 
This concludes the proof of theorem 7.23. □ 

Tracking back through the isomorphisms, we can formulate the following. 

Corollary 7.26 Under the assumptions of theorem 7.23, there exists an ele- 
ment A e Lie(G) which comm,utes with the image of t, such that the splitting 
a : T{TM) r(rQ/M+)SL(R")xzG ^^^g„ 

a{v) = V o D{v) + DivA(u)A , 

where A is the volume form induced by g, D is the natural lift from T{TM) to 
r(F+(M))/]R+), and we have identified A with the vector field on Q/M.'^ induced 
by the action of the Lie algebra element. 

It is clear that two different homomorphisms Ti and T2 yield isomorphic SpinG- 
structures if one is obtained from the other by conjugation within G. 
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7.7.4 Induced Unitary Representations of Diff°(M) 

Before we explore the ramifications of theorem 7.23 on some theories of physical 

interest, wc pause for a second to examine the representations of Diff"(M) that 
Uve on infinitesimally natural bundles of the type described on page 180. 

Recall that a metric is precisely a section of the bundle of symmetric spaces 
77 : R{M) M, with R{M) := F+{M)/SO{ri). The bundle describing fermions 
and metric is E{V) := F^{M) Xgpj^z V, with V a representation of Spin^ = 
S0(?7) Xz 'Ki{F{M)) of spinor type. The projection EiV) R{M) makes it 
into a vector bundle over R,{M), and the projection E{V) M makes it into 
a fibre bundle, unfortunately not a vector bundle, over M. 

If we fix M to be M", and g to be the Minkowski metric rj, then the action of 
the connected Poincare group S0''(r7) k M" on OF+(M) induces an action (by 
automorphisms) of its cover on OF^{M). This in_turn induces a representation 
on the sections of the associated vector bundle OF+(M)XgQ(,^^jF, the space of 
spinors. The Minkowski metric gives rise to a unique (up to a scalar) Poincare- 
invariant volume form vol^ on M, which we use to turn the nondegenerate 
Lorentz-invariant form on V into the nondegenerate Poincare-invariant form 
{ip I ^y) = Jj^ji'^l-', '>l-'')vcAri on the space of sections. The group action commutes 
with the Dirac operator, and the representation therefore decomposes according 
to mass. 

We would like to do something similar for the diffeomorphism group instead 
of the Poincare group. The action of DifF'^(M) on F'^{M) induces an action by 
automorphisms of Diff°(M) on F+{M), and therefore also on R{M) and E{V). 
Unfortunately, the sections r{E{V) M) of physical relevance do not form a 
representation, or even a vector space for that matter. 

We do however have a representation of Difr°(M) on r{E{V) R{M)). We 
will show that R{M) carries a unique (up to a scalar) Diff°(M)-invariant volume 
form^ vol, so that r{E{V) — > R(M)) has a nondegenerate Diff'^(M)-invaj:iant 
form {(t>\i^)= /fl(M)(0' V')vol. 

An Invariant Volume 

The only special asset of SO(r/) that we will use is the fact that it is an open 
subgroup of the fixed point set of the involution g ^ ri{g*)~^r] of SL(M"), so we 
may as well work in that setting. 

Proposition 7.27 Let M be a connected n- dimensional smooth manifold, and 
let H < SL(E") be an open subgroup of the fixed point set of an involution. Then 
the bundle R{M) = F^{M)/H of symmetric spaces admits a diffeomorphism- 
invariant volume form vol. R is everywhere nonzero, and unique up toM.^ . 

Proof We write f) for the Lie algebra of H. The decomposition f|l(M") = f) © m 
of 0[(M") into a positive and a negative eigenspace of the Lie algebra involution 

^It would be tempting to interpret the volume form {tl>, i/')vol on R(M) as the probability 
density of observing a ferinion at a certain point of space-time and with a certain value of the 
metric, but as far as 1 am aware, this is not what physicists do. 
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is invariant under the adjoint action of f). In particular, m is an [)-representation. 
We construct a canonical map 

A"(M"*) a'=(0[(M")*)'' 4 0"+'=(ii(M))°^*f°(^\ 

with k = dim(m). Choose A e A"(M"*) and n e A'=(0l(M")*)''. The former 
is a volume form on M", the latter can be thought of as a volume form on 
0[(M")/f) ~ m. Wc construct the volume form on R{M) over each point [/] S 
R{M) separately, using the isomorphism TfF^{M)/if{k}) ^ T[fjR{M) induced 
by the quotient map F+(M) R{M). 

Each tangent space TfF~^{M) fits into the exact sequence of vector spaces 

— > ^ TfF+{M) M" ^ , 

where ■— ^|o/ ° R*^- and Of = o tt*. 

The n-form 0jA annihilates the image oiif, so in particular, it defines an 
n-form on on T[j:]i?(M). This is in fact independent of the choice of / represent- 
ing [/]. The two identifications T[f]R{M) ~ TfF+{M)/ij{\)) and T[/]i?(M) ~ 
TfhF'^{M)/ifh{\)) are intertwined by Rh*, the pushforward of right multipli- 
cation by h. We have Oft o R^* = /i~^/~^7r*i?/i« = h~^f~^Tr» = o Of, so 
that {Rh^yOf^X = 0*h-^*X. Because h-^*X = X, (h e H < SL(M") is volume 
preserving), we even have {Rh*)*0*jf^X = OjX. Since the two forms 0*fX and 0*ff^X 
are related hy Rh*, they define the same n-form on T[f]R{M). 

In order to transport /i to TfF^(M), we need to choose a splitting Sf : 
TfF+{M) 0l(M") such that Sf oif = id. Consider the n + k-iorm s*n A 6*} A. 
Although s*fiJ, depends on the choice of splitting, the form Sj/i A 0*fX does not. 
Indeed, since (sf — Sj) o if = for every two sections Sf and s'f, we have 
i*j:{s*j,fj, — s'f n) = 0, which implies that (s^^ — ^'f f^) ^}'^ ^ ^■ 

It is also independent of the choice of /. Since Rh* ° ifiO = 3^|o/e*^/i = 
^\ofh{h~^e*^h) = ifh o Ad/j-i(^), we obtain Sf = Ad^ o Sfh o Rh^, so that 
s*fH = {Rh^y s*f f^Adh*^. 

Note that Adh respects the nondegenerate bilinear form k{^, x) = tr(^x) on 
gl(IR"), and hence its restriction to m. Since Adh : m — )• m is orthogonal w.r.t. 
K, it certainly preserves the volume form fi. 

Now since Ad^^n = fi, we have s*ffi = [Rt*)* s*ff^ ji, as required. This means 
that vol[y] = Sjfi A 0*j.X is a well-defined (n + A:)-form on T[f^R{M) for any [/]. 
Since our construction varies smoothly with [/], they combine into a smooth 
volume form vol on R{M), which clearly depends only on A (g) /i. 

A diffeomorphism a G DifF''(M) induces an automorphism a* of i^+(M) 
by pushforward, and therefore also one of R{M). The diffeomorphism invari- 
ance of the volume form is a straightforward consequence of the diffeomor- 
phism invariance of / h- > Of and / h^- if. Indeed, invariance of the volume, 
(a*)*vol = vol, is equivalent to (a**)*(s*^j/Lt A 0*^j:A) = s*f^/\0*fX. This is true 
because Oa,f o a** = Of and Sa,f o a„ = Sf. The first can be seen by expand- 
ing the definitions and using o tt o a* = n, the second is a consequence of 
a** o ifiO = i^a*/e*^ = ia,f{0- 
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If M is connected, then DifF (M) acts transitively on R{M), so that the in- 
variant volume is determined by its value in a single point. It is therefore unique 
up to a real scalar. The fact that it is nonzero is clear from the construction. □ 

Remark The space R{M) has more structure than just a canonical volume. 
In fact, its frame bundle has a canonical Diff°(M)-invariant subbundle with 

structure group H K Lin(]R",Tn), with multiplication given by {h,v){h' ,v') = 
{hh' , Adh o v' + V o h'). This implies the existence of an invariant volume, 
because the structure group is contained in SL(]R" x m). 

Now let H be the preimage of H under G{1, M) — )• GL"'"(]R"'), an extension of 
H by 7ri(i^(Af)). (Cf. dcf. 7.6.) If is a representation of H, we construct the 
vector bundle E{V) R{M) by E{V) := F+{M) x^V. 

The action of Diff"(M) on F+(Af) induces an action by automorphisms of 

Diff"(M) on F+{M), and therefore also on R{M) and E{V). (It is just the 
integrated version of the natural lift of T{TM).) This makes T{E{V)), the 
space of smooth sections of EiV) — )■ R{M), into a representation of Diff°(M). 

If V has an i?- invariant (bilinear or sesquilinear) form ( • , • ), then this 
induces a form on each fibre of E{V) R{M). This defines a form {(j>\ip) = 
^R(M)^^^ V)^^ on T[E(y)). Because vol is everywhere nonzero, the form ( • | • ) 
is positive definite and/or nondegenerate if and only if ( • , • ) is. Moreover, the 
form ( • I • ) is DifF° (M)-invariant because the volume is. It is perhaps worth 
while to summarize this in a corollary. 

Corollary 7.28 Let H < SL(M") be an open subgroup of the fixed point set of 
an involution. Let H be its preimage in G{1,M), and let V be a representation 
of H with invariant form (•, •). Then the induced representation V{E{V)) of 
I)ifF°(M) has an invariant form = /r(m) ("^^j '^)^°^' Diff*^- 

invariant volume on R{M). It is nondegenerate and/or positive definite if and 
only if {■ , ■) is. In particular, any unitary representation of H induces a unitary 
representation o/DifF°(M). 

In the case that ^ is a spinor type representation of S0(?7) x ^tti {F), the induced 
representation on T[E(y)) is only unitary if is positive definite. On the other 
hand, an invariant sesquilinear form exists for any signature. 

Decomposition into Irreducibles 

These induced representations will in general not be irreducible. We sketch the 
decomposition into irrcps, in the particular case that M is a parallelizable, com- 
pact, connected manifold of dimension > 2, and is a unitary representation 
of H that factors through H. (So that the induced representation will factor 
through Diff'^M).) The problem then essentially reduces to harmonic analysis 
on the symmetric space A = SL(M")/i/. 

We fix once and for all a global section f : M ^ F^{M) of the frame 
bundle. It identifies F+{M) with M x GL+(M"), by way of {x,g) ^ f^g. If 
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a* : TxM T^^^^M is the pushforward, then we define D^a to be the matrix 

of a, w.r.t. the frames at x and a{x), i.e. D^j^a := f~} , o o f^. The action of 

a on M X GL+(M") is then a : {x,g) ^ {a{x),Dxa ■ g). 

We identify 1R+ x SL(M") with GL+(R") by (A,g) ^ y/Xg. With A = 
SL(E")/i7, wc then have i?(Af) = Af x E+ x A. The action of a on R{M) is 
a : {x, A, w) h-> {a(a;), det(£'a;a), D^a ■ w), with D^a := D^a/ i^det(L',,.a). 

If voIm is the volume on M defined by the frame, and voIa is the unique (up 
to M^) SL(IR")-invariant volume on A. then it is not hard to sec that the unique 
(up to M'^) Diff°-invariant volume on R{M) is vol = voIm A dA/A^ A voIa- 

The bundle E{V) is simply M x M+ x EaCV), with EAiV) the vector bun- 
dle SL(M") Xh V over A. Its space H of L^-sections constitutes a unitary 
representation of SL(M"), the induced representation of V. 

We identify the space of L^.gections of E(y) with L'^{M, L'^(R+)iSi'H), where 
M+ is equipped with dX/}?, and M with voIm- The representation of Diff°(M) 
is then given by 

{aiij){x) = p(dct(i:»^Q;-i))-i (g) T:{b^a-^)-^ti}{a-^{x)) , 

with p the nonunitary M+-representation {p{c)f){X) = f{c~^X) on the Hilbert 
space L^(]R+, dX/}?), and tt the unitary SL(M")-representation on y.. 

We decompose p into irreducible representations. The unitary transforma- 
tion W : L'^{M.+ ,dX/X^) L^{R,dt) defined by {Wf){t) = e-*/2/(e*) takes 
p into p'{c) = WpW-^{c), with (p7')(*) = c'^^'^f'it - log(c)). The Fourier 
transform T : L'^{R,d.t) L'^{R,dk) then takes p' into /3(c) = J"p'(c)J"-\ 
with {p[c)f){k) = c^^^^^^''f{k). We conclude that the overall transformation 
J^oW: i2(K+, dA/A^) ^ L^{R, dk), given by 

f{k) = / X'/'-''f{X)dX/X^ , 
V i-K Jo 

identifies p with the representation {p{c)f){k) = c~^/'^~^^ f{k) on L^(R, dfc). 
We write L'^ {M + , dX / X^) ~ J® Ckdk and p ~ Pkdk, with (pfe,Cfe) the one- 
dimensional representation pk{c)z = c~2~*'^z. (See e.g. [Dix69, ch. II] on direct 
integrals.) 

Consequently, the Difi'°(Af )-representation L'^{M x M+,?^), with measure 
voIm a dX/X^, is unitarily equivalent to L?{M x M, "H), with measure voIm A dk 
and representation 

(aV')(a;, k) = det(£)^a-^)^/2+''=7r(£>^a-^)- ^(""^(a;), k) . (7.15) 

We thus have a decomposition (M, (M+) o ^) ~ J® (M, Cj, O n)dk into 
subrepresentations, where the unitary Di0'°(Af)-reprcscntation on the Hilbert 
space L^(Af, Cfe (8) "H) is given by equation (7.15), but now with fixed k. 

In general, the representations L'^ {M, Ck (8) H) will still be reducible. Now 
suppose that we can decompose H into unitary irreps as = 7ipiJ,{d(3) for 
some measure space {B,p), with tt = TT^p^dj}). Then 

L2(M,L2(K+),g)^)c:, r L^{M,Ck(E>nfi)dkp{dl3), (7.16) 

JrxB 
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with the representation on L^(M, Cfc (g) 'Hp) given by 

= det(D,a-i)i/2+^%(£),a-i)-V(a"'(a;)) • (7.17) 

According to a theorem stated (but not proven) in [Ism72], these represen- 
tations of Diff"(M) are irreducible, at least under the assumption that M is a 
compact manifold of dimension at least 2. 

This means that the problem of decomposing the L^-closure of T[E{y)) 
into irreducible representations of Diff°(M) reduces to that of decomposing the 
L^-closure of T{Ei^{V)) into irrcps of SL(M"). Although this is not an easy 
problem, it is one that has been studied in considerable detail. 



Example: The Hyperbolic Plaine 

We explicitly carry out the decomposition in the special case that M is the 
2-torus T2 with global frame (^, J^), the subgroup is H is S0(M2), and F = C 
is the trivial representation. 

We identify the symmetric space SL(IR^)/S0(IR^) with the open unit disk 
A = € C; < 1}. It comes equipped with the SL(M-^)-action g ■ z = 
A{g)z + B{g)/{B{g)z + A{g)), where A{g) and B{g) are defined by 

A B\ 1/1 -i\ (\ 1 



B A) 2 



1 -i ■ (^-l^) 



(This is the identification of SU(1, 1) with SL(R^) that comes from the complex 
isomorphism of A with the upper half plane.) The SL(M^)-invariant volume on 
A is 

idz A dz 
^^'- = W^- 

Since Ea{V) is the trivial bundle A x C, its space of L^-sections is L'^{A, voIa). 
It decomposes into the spherical principal series representations np, which we 

will define shortly. 

The unit circle = {ui € C ; = 1}, equipped with the rotation invari- 
ant measure dij/{2ni(jj), carries an SL(R^)-a€tion oj g ■ u because it is the 
boundary of A. Define the Fourier transform J" : i^(A) L^(R+ x S^) by 

(^/)(/j,.)=y^/(.)(^) voiA. 

If we equip M"*" with the measure ii{d(i) = (27r)^^/3 tanh(|7r/3)d^, then it is a 
unitary isomorphism [Hel84, p. 33]. By direct calculation, one verifies that 

(7-7r(ff-i)^-V') = \B{g)uj + A{g)f0-'f{p,g-co), 

with A{g) and B{g) as in equation (7.18). In other words, the Fourier trans- 
form identifies the representation of SL(M^) on L^(A) with the direct integral 
'^i3t-i'(dd), where is the spherical principal series representation of SL(M^) 
on L^S^) defined by {irpig-')/) (w) = \Big)Lo + ^(5)^^-1 f{g ■ co) . 
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In this particular case, equation (7.16) means that the DifF -representation 
on the L^-sections of E{V) is unitarily equivalent to L'^{T'^ x x M+ x M"*"), 
with the measure d(p A d9 A (iw/(27rzw) A dk A (27r)~^/3 tanh(7r/3/2)ii/9, and the 
representation 

{iT{a)tlj){x,u;,k,^) = Ck,fi{D:,a~'^)ip{a~'^{x), D^a~'^ • , 
with the cocycle (A and B as in (7.18)) 

CkADxa-^) = det(Z),a-i)i/2+ifc|B(D,a-i)w + A{D,a-')f-^ . 

(There is no need for the projection D^a"^, as the scalars act trivially on A.) 

The unitary representations of DifF'^(T^) that one gets when fixing k and /3 
are irreducible according to [Ism72]. 

7.7.5 Some Physical Theories 

If we accept that any physically relevant SpinG-structure must be infinitesimally 
natural, and that G should be the gauge group of the theory, then theorem 7.23 
provides a link between the spectrum of elementary particles and the global 
topology of space-time. 

The fact that the mere existence of a generalized spin structure may place 
restrictions on the space-time manifold was recognized by Hawking and Pope 
[HP78]. Generalized spin structures were classified [AI80], and it was found that 
if the Lie group contains SU(2), then 'universal spin structures' exist [BFF78], 
irrespective of the topology of M. In particular, there are no topological ob- 
structions to the existence of a SpinG-structure as soon as SU(2) < G. 

But according to tlicorein 7.23, this changes if one requires the SpinG- 
structure to be infinitesimally natural. Universal spin structures then exist only 
for certain noncompact groups. For compact G, the requirement that there ex- 
ist a hoinoinorpliisin tti{F^ (M)) G fixing Z provides an obstruction on the 
space-time manifold M in terms of the group of internal symmetries G. Let us 
see what this means in some specific cases. 

Weyl Spinors 

Consider a single massless charged Weyl spinor coupled to a U{1) gauge field. 
For simplicity, let us assume that M is 4-dimensional, oriented, and time- 
oriented, so that Z ~ Z/2Z and we may use SL(C^) instead of SO{rj). 

This means that G = U{1) and V = C'^ (S) Cg, the two-dimensional defining 
representation of SL(C^) tensored with the one dimensional defining represen- 
tation of U(l). 

Note that the representation descends to Spin*^ = SL(C^) Xz U{1). (Z is 
just ±1 in ?7(1).) This means that upon choosing a Spin'^-structure Q, the 

configuration space is r(J^((5) Xgpinc V). 

Let us now impose the requirement that Q be infinitesimally natural. The- 
orem 7.23 tells us that there must then be a homomorphism 7ri(i<'+(M)) — )• 
C/(l) sending Z/2Z to ±1. If 7ri(Af) is finitely generated, then the image 
of 7ri(i^+(M)) in is a finitely generated subgroup containing ±1. It 



189 



must be isomorphic to Z" x {^/2m'L) for some n,m € N. If we then send 

Z" to 1, we obtain a homomorphism 'ni{F'^ {Ad)) — Z/2mZ. The sequence 
1 — > Z/2Z ^ Z/2toZ — )■ ItjrnL — )■ 1 splits precisely when m is odd. 

We conclude that an infinitesimally natural Spin°-structure exists on M if 
and only if there is a surjective homomorphism 7ri(i^+(Af)) — > IjlrnL which 
preserves TLjlTL. Only if ni is odd does this give rise to a spin structure. 

Dirac Spinors 

Next, consider the case of a Dirac spinor. That is, F = C' ig) Cg, with the 
representation of C^C*) which splits into two identical irreps ® under 
SL(C^); the left handed and right handed spinors. 

The fact that V is reducible under Spin'^ makes us re-examine our assumption 
that the group G in theorem 7.23 should be the gauge group UiV)- Indeed, the 
unitary commutant of SL(C^) in V is [/(2) rather than [/(I). If we take any 
discrete subgroup H < U{2) and form the group U{1)h generated by H and 
U{1), can we take Q to be a SpinG-structure with structure group [/(l)^? 

As far as only the kinematics is concerned, the answer is yes. The generic 
fibre of J^{Q) XgpjjjC V is the same for G = U{1) as it is for G = U{1)h, so 
adding H will not change the space of local sections. 

But if we take into account the dynamics, the answer becomes: 'only if H 
is a group of symmetries of the Lagrangian'. The reason is that Lagrangians 
are usually defined in local co-ordinates, yielding a local action Su for each 
co-ordinate patch U C M . However, as the dynamics of the theory should be 
governed by a global action functional Sm, it is necessary for Su and Sy to agree 
on U (IV. This means that if H is part of the structure group of the bundle, 
then it must leave the Lagrangian invariant. If _ff is a global symmetry, then 
the transition functions must be constant. This is automatic if H is discrete. 

For instance, in the case of a massive Dirac fermion, the subgroup of U (2) 
which preserves the Lagrangian is precisely the diagonal U{1). This means that 
the relevant SpinG-structures are precisely the Spin'^-structures classified above. 

For massless Dirac spinors, the left and right Weyl spinors decouple, so that 
the relevant symmetry group is Ul{1) x Uji{l). Although the requirement on 
a manifold to carry a SpinG-structure does not change, this does give us more 
SpinG-structures for the same manifold. 

This illustrates that we may enlarge the gauge group G by any group of 
discrete symmetries of the Lagrangian in order to obtain SpinG-structures. In 
particular, this means that infinitesimally natural Spin'^-structures are allowed 
even for uncharged Weyl spinors. (The image of 7ri(F+(M)) is automatically 
discrete.) 

The Standard Model 

In the the standard model of elementary particle physics, the gauge group 
G is (SU(3) X SU(2)l x U(l)y)/iV, with N the cyclic subgroup of order 6 
generated by {e'^'''^^l, -1, e^""-/*^). It is isomorphic to S{U{3) x U{2)), a sub- 
group of SU(5), and it has a unique central subgroup of order 2 generated by 
diag(l,l, 1,-1,-1). 
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The fermion representation V for a single family can be conveniently de- 
scribed (see e.g. [BHIO]) by (E) A*C^, the tensor product of the defining 
representation of SL(C^) and the exterior algebra of the defining representation 
of SU(5). Under SL(C2) x S{U{3) x U{2)), this decomposes into 12 irreps cor- 
responding to left and right handed electrons, neutrinos, up and down quarks 
and their antiparticles. 

Unfortunately, diag(l, 1, 1,-1,-1) € G acts by +1 on right-handed fermions, 
whereas — 1 (E SL(C^) acts by —1. This means that V does not define a repre- 
sentation of SL(C^) X z S{U{'6) X U{2)) if one were to identify the central order 
2 elements on both sides. 

As the gauge group alone is of no use when trying to find a SpinG-structure, 
one has to involve the group of global symmetries of the standard model La- 
grangian. It contains the gauge group G, but also (at least on the classical level) 
the global U{1)b x J7(l)L-symmetrics that rotate quarks and leptons indepen- 
dently. (These are connected to baryon and Icpton number.) 

We conclude that the only infinitesimally natural SpinG-structures relevant 
to the standard model are the ones associated to homomorphisms 

7ri(F+(M)) ^ G (7.19) 

that preserve Z, the subgroup of U{1)b x U{1)l generated by (—1,-1). In 
this expression, G is the group of global symmetries of the standard model 
Lagrangian, which at least contains S{U{3) x U{2)) x U{1)b x U{1)l- 

Infinitesimally Natural Spin'^-Structures for the Standard Model 

It seems that any manifold which possesses an infinitesimally natural SpinG- 
structure for the standard model automatically permits an infinitesimally nat- 
ural Spin'^-structure. (This is at least the case if the group G of global symme- 
tries is S{U{3) X U(2)) X U{1)b x U{1)l-) On the other hand, there do exist 
SpinG-structures for the standard model which are not Spin'^. We construct an 
example. 

Consider de Sitter space H = {x G — + + x?, + xl + xl = 1}, 
which has a pseudo-Riemannian metric g with constant curvature induced by 
the Minkowski metric in the ambient M.^. Its group of orientation preserving 
isometrics is S0(l,4), and H ~ S0(1, 4)/S0(l, 3). Denote by OF+^{H) the 
bundle of orthogonal frames with positive orientation and time-orientation. By 
viewing OF+^{H) as a submanifold of x S0(1, 4)°, one can see that S0(1, 4)° 
acts freely and transitively by x : / i— >■ a;*/. Therefore OF^^{H) is diffeomor- 
phic to S0(l,4)". 

Now let r < SO (4) be a discrete group which acts freely, isometrically and 
properly discontinuously on S^. The manifold T\S'^ is called a spherical space 
form. (See [Wol67] for a classification.) As V includes into S0(l,4)°, it acts 
on H, making M = Y\H into a pseudo-Riemannian manifold with constant 
curvature. 

We immediately see that 7ri(M) = F, because H is simply connected. We 
calculate the homotopy group of the frame bimdle. Because OF^^ (M) is just 
V\OF^'^ {H), it is isomorphic to r\S0(l,4)°. Going to the universal cover, 
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we see that OF+{M) = f\S0(l,4)°. As T < S0(4), we may consider f to 
be the preimage of F in Spin(4). As the universal cover is simply connected, 
it is now clear that 7ri(OF+^(M)) = F. We get for free a homomorphism 
r — )• Spin(4) ~ SU(2)i x SU(2)r, which maps the noncontractible loop in the 
fibre to (-1,-1). 

Triggered by the WMAP-data on cosmic background radiation, there has 
been some interest in the case where F is /*, the binary icosahedral group 
[LW+03]. Some evidence against [KC+07] as well as in favour of [RB+08] this 
hypothesis appears to have been found. We carefully avoid choosing sides in the 
debate, and simply point out that I*\H gives rise to an interesting infinitesi- 
mally natural SpinG-structure. 

Under the identification Spin(4) ~ SU(2)i x 811(2)^, we see that F = /* x 1 
lives only in SU(2)i, so that F is the direct product of /* x 1 and the Z/2Z 
generated by (—1,-1). One can therefore define a homomorphism (7.19) by 
identifying SU(2)( with SU(2)l < G, and mapping (-1,-1) to (-1,-1) G 
U{1)b X U{1)l- This yields an infinitesimally natural SpinG-structure which 
uses the noncommutativity of the gauge group in an essential fashion. Note 
however that 'ordinary' Spin'^-structures also exist on M. 

Extensions of the Stcindcird Model 

The fact that S{U{3) x U{2)) does not contribute to the obstruction of finding 
SpinG-structures on M is due to the fact that it never acts by —1 on V. This 
is not true for some GUT-type extensions of the standard model, such as the 
Pati-Salam SU(2)l x SU(2)i{ x SU(4) model and anything which extends it, for 
example Spin(lO). 

If N is the group of order 2 generated by (— 1,— 1,— 1), then infinitesimally 

natural SpinG-structurcs in the Pati-Salam model correspond, neglecting global 
symmetries, to homomorphisms ni{F+{M)) -J> SU(2)l x SU(2)/{ x SU(4)/iV 
which take Z to ((—1, —1, 1)). This has the rather intriguing consequence that 
there may well exist space-time manifolds M which are compatible with the 
Pati-Salam model, but not with the standard model. A manifold M would have 
this property if the smallest quotient of tti{F{M)) containing Z is a nonabelian 
subgroup of SU(2)l x SU(2)fl x SU(4) containing (-1, -1, 1). 

7.8 Discussion 

A natural bundle is one in which diffeomorphisms of the base lift to automor- 
phisms of the bundle in a local fashion. In corollary 7.15, we have rederived the 
well known [PT77] result that any natural principal fibre bundle is associated 
to the fc"^ order frame bundle F^{M). 

Bundles associated to spin structures are almost never natural, but nonethe- 
less indispensable to physics. We would therefore like to extend the notion of 
'naturality'. One way to do this is to introduce 'gauge natural bundles' [EckSl] 
[KMS93], which do not transform according to diffconiorphisnis of the base M, 
but according to automorphisms of a bundle over M which has to be specified. 
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We here propose to accommodate spin structures in a different fashion. Al- 
though the Lorcntz group does not act on a spin structure, its associated Lie 
algebra does. Analogously, we define an 'infinitesimally natural' bundle over 
M to be a bundle in which it is not the group of diffeomorphisms of M that 
lifts, but its Lie algebra of vector fields. This has the advantage that the link 
between the base and the fibres is not lost, so that a stress-energy-momentum 
tensor can be constructed from Noether's principle [Noel 8]. 

We assume only that the lift is a homomorphism of Lie algebras. However, 
the careful analysis by Gotay and Marsden [GM92] reveals that in order to 
define a SEM-tensor from Noether's principle, one needs a lift that is given by a 
differential operator. Assuming only that the lift is a homomorphism, we prove 
in proposition 7.7 that it is actually a differential operator . This shows that one 
can construct a SEM-tensor from Noether's principle precisely when the bundle 
is inffnitcsimally natural. Wc thcrc^forc propose to describe fields on space-time 
by sections of inffnitcsimally natural bundles. 

We have classiffcd the infinitesimally natural principal fibre bundles. Ac- 
cording theorem 7.14, they are associated to the universal cover F^^{M) of 
the fc"^ order frame bundle, and their transformation behaviour is governed by 
the disconnected group G{k,M). This group depends on the manifold. It gen- 
eralizes the Pin group in the sense that it regulates parity, time reversal and 
Lorentz transformations. 

Theorem 7.14 is originally due to Lecomte [Lec85]. Although our proof 
was obtained independently, it does rather resemble Lecomte's. We pause to 
highlight a few differences. First of all, Lecomte does not work in the compactly 
supported setting: he bases his version of lemma 7.5 on results of Amemiya 
[Ame75] rather than on Pursell and Shanks' lemma 7.2.1. Secondly, our uniform 
bound, in proposition 7.7, is different from Lecomte's. But the most significant 
difference is probably that Lecomte bases his proof on integration of foliations 
rather than on integration of Lie algebroids, which probably makes our proof 
easier to generalize. Strikingly enough, Lecomte's work does not appear to be 
motivated by physics; he mentions spin structures only briefly. 

The consistent description of fermions in the presence of a gauge group G 
requires a SpinG-structure rather than a spin structure [HP78], [AI80]. All 
spin structures are inffnitcsimally natural, but some SpinG-structures are not. 
As described in theorem 7.23, infinitesimally natural SpinG-structures for com- 
pact G correspond to homomorphisms 7ri(_F+(Af)) — >■ G that are injective on 
7ri(GL+(R")). 

The existence of infinitesimally natural SpinG-structures therefore provides 
an obstruction on space-time M in terms of the symmetry group G. Some 
manifolds are even excluded for any (compact) choice of G. For example, CP^ 
does not admit any infinitesimally natural SpinG-structure because its frame 
bundle is simply connected. In our eyes, this disqualifies it as a model for 
space-time. 

If all global symmetries are gauged, then theorem 7.23 constitutes the "con- 
nexion between the topology of space-time and the spectrum of elementary 
particles" alluded to in [HP78]. 
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Samenvatting 



Hot meetproces in de kwaiituiiiiiicclianica is ccn wat bizarre aangelegcnlicid. 
lets wat in het dagelijks leven volstrekt vanzclfsprckciid is. zoals het gelijktij- 
dig nieten van twee verschillende dingen, kan op kleine schaal opeens volstrekt 
onmogelijk blijken. 

In hoofdstuk 3 en 4 is een aantal van dit soort fundamentele beperkingen 
zo scherp mogelijk geformuleerd. Je kunt immers moeilijk je grenzen opzoeken, 
als je niet weet waar die liggen. Vervolgens is voor een tweetal meetmethoden 
vastgesteld hoeveel verbetering er theoretisch nog mogelijk is. Het zijn allebei 
methoden om de inwendige toestand van een atoom vast te stellen aan de hand 
van het licht dat het uitstraalt. De methode van hoofdstuk 6 blijkt optimaal te 
zijn, die van hoofdstuk 5 laat nog een klein beetje ruimte voor verbetering. 

Hoofdstuk 7 gaat over een ander deel van het onderzoek. AUe velden die in 
de natuur voorkomen (elektrische velden, magnetische velden, . . . ) bewegen mee 
met kleine vervormingen van de tijdruinite. Het belangrijkste resultaat in dit 
deel van het proefschrift is een classificatie van alle 'meebewegende' velden die 
theoretisch mogelijk zijn. Verrassend genoeg zijn dat er in zekere zin tamelijk 
weinig, en bovendien hangen de mogelijkheden sterk af van de vorm van de 
tijdruimte. 

In de rest van deze samenvatting wil ik graag zander wiskundige formules een 
preciezer beeld geven van de inhoud van dit proefschrift. Ik beperk me daaxbij 
tot de wiskundige ideeen in hoofdstuk 7, omdat die het best uit te drukken zijn 
in woorden en vooral ook plaatjes. 

Elektrische Velden en de Raakbundel 

Een elektrisch veld is lets wat trekt aan geladen deeltjes. Hoe sterker het elek- 
trisch veld, hoe harder de deeltjes in de richting van dat veld getrokken worden. 




Fig. 30: Een elektrisch veld zet geladen deeltjes in beweging. 
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Dit is bijzonder handig, bijvoorbeeld als je een broek wilt strijken. Je neemt 

dan ccn dunne metalen plaat, die vol zit met geladen deeltjes (elektronen). 
Plaats deze bovenop de te strijken broek. Zorg vervolgens voor een elektrisch 
veld in de metalen plaat, bijvoorbeeld m.b.v. een stopcontact. De geladen deel- 
tjes worden in de richting van het veld door de plaat getrokken, die door alle 
wrijving opwarmt. Door vervolgens de plaat zachtjes been en weer over de broek 
te bewegen kan men vouwen en kreukels verwijderen, of deze er juist in strijken 
(in het geval van de auteur vooral dat laatste). 

De Ruimte van Punten 

Laten we proberen om het elektrisch veld wiskundig te beschrijven. Stap een 
is een beschrijving van de ruimte M van punten waar het veld kan leven. Dit 
vraagt om een zekere mate van algemeenheid. Wat M precies is, hangt immers 
af van de situatic. 

In een dunne metalen plaat kan ieder punt bijvoorbeeld beschreven wor- 
den door 2 coordinaten, lengte (a;) en breedte (y). De ruimte M heet dan 2- 
dimensionaal. Maar elektrische velden leven ook in de aarde, in de lucht en zelfs 
in de lege ruimte. Je hebt daar 3 coordinaten nodig om een punt te beschrijven: 
lengte (a;), breedte (y) en hoogte {z). De ruimte M is dus 3-dimensionaal. Won- 
derlijk genoeg houdt het hier niet op. Volgens Einstein zijn tijd (1-dimensionaal) 
en ruimte (3-dimensionaal) 'van hetzelfde spul gemaakt'. Er zijn dus 4 coordi- 
naten nodig om een punt in de tijdruimte te beschrijven: lengte (a;), breedte (?/), 
hoogte {z) en tijd {t). Met andere woorden: de tijdruimte M is 4-dimensionaal. 




Fig. 31: Gladde varieteiten van dimensie 1 (links) en 2 (rechts). 

Om dit alles in een klap te vangen, beschrijven we de ruimte M van punten 
als een gladde varieteit (smooth manifold). Gladde varieteiten heb je in elke 
gewenste dimensie. Een gladde varieteit van dimensie 1 is niets anders dan een 
(kromme) lijn. Een gladde varieteit van dimensie 2 is een (gekromd) oppervlak. 

Zoals gezegd is onze tijdruimte M een gladde varieteit van dimensie 4. Dit is 
lastig te tekenen, zeker aangezien de tijdruimte ook nog eens gekromd is. Maar 
wiskundig gezien zijn 4 dimensies niet vecl inocilijker dan 1 of 2. Bij woorden 
als 'gladde varieteit', 'ruimte' of 'tijdruimte' kan men dus gerust figuur 31 in 
het achterhoofd houden. 
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De Raakbundel 

Stap een was het beschrijven van de ruimte M van punten. Stap twee is het 
veld in een vast punt^ x van M. Het veld geeft aan hoe hard er aan een geladen 
deeltje wordt getrokken. Hoe sterker het veld, hoe groter de trekkracht. Maar 
daarnaast bepaalt het veld ook in welke richting er wordt getrokken. Het veld 
in X is dus lets wat naast grootte ook richting heeft: een pijl of vector. Hij moet 
natuurlijk raken aan M, omdat de deeltjes anders de ruimte zouden verlaten. 
Kortom: het elektrisch veld in een punt x beschrijven we wiskundig als een 
vector die aangrijpt in x en raakt aan M. 




Fig. 32: Links een gladde varieteit M van dimensie 2. In het midden een raakvector 
die aangrijpt in het punt x. Rechts het raakvlak in dit punt. Alle vectoren die raken 
aan M en aangrijpen in x hcbbcn hun kop in dit vlak. 

Voor de derde en laatste stap in de beschrijving van het veld moeten we ons 
realiseren dat het elektrisch veld niet louter leeft in een punt. Voor ieder mogelijk 
punt in de ruimte bepaalt het hoe hard er aan geladen deeltjes getrokken wordt. 
We beschrijven het veld dan ook als een hele familie vectoren, een raakvector 
die aangrijpt in x voor elk punt x van de ruimte M. 




Fig. 33: De bolschil is een 2-dimensionale gladde varieteit. Het elektrisch 
veld hierop wordt beschreven door een snede van zijn raakbundel. Dit is iets 
wat in ieder punt x een raakvector uitkiest: de waarde van het veld in x. 

Alles bij elkaar geeft dit een wiskundige beschrijving van het elektrisch veld. 
De ruimte waarop het veld leeft, beschrijven we met een gladde varieteit M. 

^Wc sclirijvcii kortwcg x in plaats vaii {x,y), {x,y,z) of {x,y,z,t). Op een heel proefschrift 
scheelt dat alras een bladzij. 
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Bij ieder punt x van M hoort een raakvlak (tangent space) T^M, de ruimte van 
alle vectoren die raken aan M en aangrijpen in x (zie figuur 32). Het raakvlak 
in X beschrijft precies alle mogelijke waarden die het elektrisch veld in x kan 
aannemen. We definieren de raakbundel (tangent bundle) als de familie van alle 
mogelijke raakvlakken T^M, waar x loopt over de punten van Af . 

We beschrijven het elektrisch veld als een snede van de raakbundel. Dit is 
lets wat in ieder raakvlak T^M precies een vector uitkiest: de waarde van het 
veld in x. 

Algemene Velden en Vezelbundels 

Het elektrisch veld is niet het enige veld. Zo bestaat er bijvoorbeeld een druk- 
veld, dat als het ware bijhoudt in hoeverre een rubber balletje zou worden 
samengedrukt als het de niocd had op een bepaalde plek in de ruimte zijn bol 
gezicht te vertonen. Een trilling in het drukveld ervaren wij als geluid. Ook het 
elektrisch veld kan trillen: dit doet het samen met zijn tweelingbroer, het mag- 
netisch veld. Rond 1864 ontdekte J.C. Maxwell^ dat licht precies bestaat uit dit 
soort gecombineerde elektromagnetisch trillingen. 

Maar ook in de moderne natuurkunde spelen velden een hoofdrol. Het hele 
idee van een 'fundamenteel deeltje' is daar zelfs vervangen door dat van een 
'fundamenteel veld' ! Deeltjes zijn dan niets anders dan trillingen in dit veld. 

Het loont dus de moeite om de voorgaande bespiegelingen lets algemener te 
trekken, en een wiskundige beschrijving te geven voor velden in het algemeen. 
Dit hebben we gedaan in hoofdstuk 2.4. Een sleutelrol is daarbij weggelegd voor 
het begrip vezelbundel (fibre bundle), dat de raakbundel generaliseert. 



Fig. 34: Schets van een vezelbundel. De basisruimte M is de kromme bene- 
den. Bij ieder punt x op de kromme hoort een vezel Fx- Alle vezels hebben 
hier de vorm van een schijf. 

Een vezelbundel over een gladde varieteit M is per definitie een familie van 
vezels, een vezel bij ieder punt x van M. De vezel bij het punt x stelt de 
verzameling voor van waarden die het veld mag aannemen in x. Die vezels 

^Hij publiceerde zijn bevindingen in "A Dynamical Theory of the Electromagnetic Field", 
Phil. Trans. R. Soc. Land. 155, biz. 459—512, 1865. Maxwell is veel geprezen voor zijn 'ontdek- 
king van het licht', lets wat voor een man met die initialen nochtans geen grote opgaaf zal 
zijn geweest. 
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mogen van allcs zijn, maar we eisen wel dat alle vezels er 'hetzelfde uitzien' (in 
jargon: isomorf zijn). 

Een belangrijk voorbeeld van een vezelbundel is de raakbundel. De vezel 
Fj. bij een punt x wordt hier gevormd door het raakvlak T^M, de ruimte van 
alle vectoren die raken aan M en aangrijpen in het punt x. Alle vezels zien er 
hetzelfde uit, in dit geval als vlakken. 

In het algenieen beschrijven we een veld als een snede (section) van de ve- 
zelbundel. Een snede is lets wat in iedere vezel precies een waarde uitkiest. 




Fig. 35: Schcts van con sncdc. In icdcrc vczcl is ccn waarde aangcwczcn. 

De wiskundige beschrijving van een veld werkt dus als volgt. De ruimte 
waarop het veld leeft, beschrijven we met een gladde varieteit M. Het soort 
veld (bijvoorbeeld: het elcktrisch veld) wordt bepaald door een vezelbundel (bij- 
voorbeeld: de raakbundel). Bij ieder punt x van M hoort dan een vezel F^ van 
waarden die het veld daar aan kan nemen (voor de raakbundel is dit het raak- 
vlak TxM). Het veld zelf wordt beschreven door een snede van de vezelbundel. 
Die wijst in iedere vezel F^ een waarde aan: de waarde van het veld in x. (Een 
snede van de raakbundel wijst bij ieder punt dus een raakvector aan.) 

Natuurlijke Bundels 

Alle vezelbundels zijn gelijk, maar sommige vezelbundels zijn gelijker dan an- 
dere. Een vezelbundel heet natuurlijk als iedere vervorming (jargon: diffeomor- 
fisme) van de ruimte M een welbepaald effect heeft op zijn vezels. (Zie definitie 
7.1 op biz. 154 voor een preciezere formulering.) Je kunt de natuurlijke bundels 
zien als een soort 'elite' onder de vezelbundels. 




Fig. 36: Een raakvector draait mee als je het vlak draait. 
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Elke raakbundel is natuurlijk. Bekijk bijvoorbeeld het platte vlak M, met 

een raakvector die aangrijpt in een punt x van M. Vervolgens draaicn wc hot 
vlak. Wat gebeurt er met de vector? Het antwoord is duidelijk: de vector beweegt 
mee als je M roteert, zie figuur 36. De raakbundel heet 'natuurlijk' omdat elke 
vervorming van de ruimte M een welbepaald effect heeft op de raakvectoren. 

Dit heeft belangrijke gevolgen voor het elektrisch veld, dat wordt beschreven 
door een snede van de raakbundel. Bij een vervorming van de ruimte M is er een 
welbepaald effect op de raakvectoren in ieder afzonderlijk punt van M. Omdat 
een elektrisch veld niets anders is dan een manier om aan alle punten van M 
een raakvector in dat punt toe te kennen, moeten we concluderen dat het gehele 
veld meeverandert als M wordt vervormd. Zie figuur 37. 
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Fig. 37: Ook het veld draait mee als je het vlak draait. 

Aan het einde van hoofdstuk 2.4 wordt uitgelegd dat het meebewegen van 
het veld bij kleine vervormingen van M noodzakelijk is om de energie (jargon: 
de 'stress-energy-momentum tensor') van een veld te definieren. 

Het feit dat elektrische en magnetische velden energie kunnen bevatten, hoeft 
niet te verbazen. Licht is immers niets anders dan een golf in het elektromag- 
netisch veld. ledereen die wel eens roodverbrand onder de zonnebank vandaan 
is gekomen, heeft het energiedragend vermogen van het elektromagnetisch veld 
dus experimenteel vastgesteld, om niet te zeggen aan den lijve ondervonden. 

Energietransport door het elektromagnetisch veld heeft trouwens ook bijver- 
schijnselen die prettiger zijn dan eerstegraads brandwonden. Leven op aarde, om 
eens wat te noemen. De energie die het leven hier mogelijk maakt, komt direct 
of indirect van de zon. Maar als het elektromagnetisch veld geen energie zou 
dragen, dan zou al die zonne-energie vrolijk op de zon achterblijven. De aarde 
zou dan koud en levenloos achterblijven. 

Enfin, elektrische velden kunnen dus, samen met magnetische velden, energie 
overdragen. Dit komt doordat de raakbundel natuurlijk is. Hoe zit dat met 
andere velden? 

We weten uit de natuurkunde dat velden eigenlijk altijd in staat zijn energie 
te dragen. Het ligt dus voor de hand om velden te beschrijven met natuurlijke 
vezelbundels. Het eerste voorbeeld van een natuurlijke vezelbundel hebben we 
dus al gezien: de raakbundel. 

Natuurlijke bundels zijn witte raven onder de vezelbundels. Een algemene 
vczclbTindcl is zomaar een familie vezels Fj., zonder cnigc vcrdcre structuur. In 
het algemeen heb je geen flauw idee hoe de vezels zouden moeten meeveranderen 
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als de ruimte M vervormt, en is het zelfs nog maar de vraag of dit hoe dan ook 
wel op een consistente manier kan. Als een vezelbundel daarentegen natuurlijk 
is, heeft elke vezel als het ware een band met zijn bijbehorend punt x. Als 
de ruimte M vervormt, 'voelen' de vezels hoe hun eigen punt beweegt, en 
bewegen mee, vgl. figuur 36 en 37. 

Infinitesimacd Natuurlijke Bundels 

Een bundel verdient dus de eretitel 'natuurUjk' als elke vervorming van de ruimte 
M een welbepaald effect heeft op de vezels F^. Die eis is soms een bcetje al te 
zwaar. We noemen een bundel daarom infinitesimaal natuurlijk als elke kleine 
vervorming van M een welbepaald effect heeft op de vezels F^. (Zie ook defi- 
nitie 7.2 op biz. 154.) Dat is een soort troostprijs. Als je een bundel bent, is 
infinitesimaal natuurlijk zijn niet zo goed als echt natuurlijk zijn, maar het is 
toch al heel wat. Voor het definieren van energie zijn bijvoorbeeld alleen kleine 
vcrvoriniiigcii nodig. Met andere woorden, een veld kan energie dragen precies 
dan als hij hoort bij een infinitesimaal natuurlijke bundel. 

De zogenaamde spinorbundels zijn infinitesimaal natuurlijke vezelbundels bij 
uitstek. De spinorbundel bij het platte vlak M is precies de raakbundel, bchalve 
dat vectoren op halve snelheid met de ruimte meedraaien, zie figuur 38. Zo een 
halfslachtige vector heet ook wel een spinor. 




Fig. 38: Een spinor draait mee met de ruimte, maar half zo snel. 

Als de ruimte 360° om haar as draait, is or netto helemaal niets gebeurd. Als 
je immers een heel rondjc draait, is daariia alles weer precies zoals het geweest 
is. In figuur 38 valt cchtcr to zicn dat de spinor desondanks is omgeklapt, van 
l naar ^. Dat is ccn bcctjc nicrkwaardig, en dczc kleine ongcrijmdhcid komt de 
arme spinorbundel dan ook op verlies van het predicaat 'natuurlijk' te staan. 
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Dit soort problemen ontstaat echter alleen bij grote vervormingen van de 

ruimte. Bij kleine vervormingen bewegen de spinoren wel netjes mee. We strijken 
daarom over ons hart en reiken de troostprijs uit: spinorbundels mogen dan niet 
'natuurlijk' zijn, ze zijn ten minste nog wel 'infinitesimaal natuurlijk'. 

Een spinorveld is een snede van de spinorbundel. Het wijst in ieder punt van 
de ruimte M een spinor aan, de waarde van het veld in dat punt. Spinorvelden 
wordcn vcel gebruikt in de natuurkunde. Je zou zeggen dat de rare flip van J, 
naar ^ onder een rotatie van 360° problemen oplevert, maar dat blijkt mee te 
vallen. De reden hiervoor is wat subtiel; het komt er uiteindelijk op neer dat een 
spinorveld nooit direct kan worden waargenomen, we zien alleen zijn kwadraten. 
En in het kwadraat van een spinorveld heb je twee flips, die elkaar ophefFen. 

Het is ook zeker niet zo dat spinorvelden alleen gebruikt worden om bi- 
zarre efFecten te beschrijven in obscure hoekjes van de moderne fysica. In het 
'standaard model', de natuurkundige theoric die al sinds de jarcn '70 vrijwcl al- 
gemeen geaccepteerd wordt, wordt alle huis-tuin-cn-kcukcmriaterie beschreven 
door spinorvelden. Net als licht een trilling is in het elektromagnetisch veld, zijn 
sommige elementaire deeltjcs trillingen in een spinorveld. Twee van dat soort 
deeltjes, de elektronen en de quarks, klonteren via een aantal tussenstappen aan- 
een tot atomen. Atomen op hun beurt zijn weer de bouwstenen van alle materia 
om ons heen. Welbeschouwd is dus iedere steen, iedere appel, iedere hond, mens 
of neushoorn een serie trillingen in een groot spinorveld. 

De Hamvraag 

We hebben gezien dat verschillende soorten velden worden beschreven door ver- 
schillende soorten vezelbundels. Een vezelbundel komt (X'litcr alleen in aanmer- 
king om een veld te beschrijven als hij infinitesimaal natuurlijk is. Zijn vezels 
moeten dan als het ware meebewegen met kleine vervormingen van de ruimte, 
zoals in figuur 36, 37 en 38. 

Dit laat de natuur relatief weinig vezelbundels om uit te kiezen. Als we ze 
allemaal kunnen beschrijven, dan geeft ons dit een idee welke velden wel in de 
natuur kunnen voorkomen, en welke niet. De hamvraag is dus: 

'Welke vezelbundels zijn infinitesimaal natuurlijk?' 

Het is natuurlijk niet zo dat, als een veld in de natuur mag voorkomen, hij 
dat ook automatisch doet. Het infinitesimaal natuurlijk zijn van zijn vezelbundel 
moet je dan ook zien als een minimumvereiste voor een veld om in de natuur 
voor te komen. 

Hoofdstuk 7 is gewijd aan het beantwoorden van de hamvraag. Het belang- 
rijkste resultaat is de classificatiestelling 7.14 op bladzijde 167. Zij zegt dat alle 
infinitesimaal natuurlijke vezelbundels afstammen van een overzichtelijke oerfa- 
milie, de zogenaamde 'universele overdekkingen van bundels van fc-frames'. 

Hoe die beesten er precies uitzien, is vrij goed bekend (zie sectie 7.4.3 voor 
een algemene beschrijving en figuur 39 voor een eenvoudig voorbeeld). Van 
belang is vooral dat het er niet al te vecl zijn, en dat hun structuur sterk af- 
hangt van de gladde varieteit M. Voor iedere M geeft ons dit een 'grijze' lijst van 
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velden die infinitesimaal natuurlijk zijn en dus in de natuur kunnen voorkomen, 

en een 'zwarte' lijst van velden die dat zeker niet kunnen. 

De grap is nu dat we eigenlijk al wel een heel aardig beeld hebben van de 
velden die in het wild voorkomen. De laatste decennia hebben fysici (CERN, 
Ferniilab etc.) enorme deeltjesversnellers gebouwd, waarniee een 'witte' lijst 
is opgesteld van velden die echt gemeten zijn, en die dus zeker in de natuur 
voorkomen. 

Aan de andere kant hebben we eigenlijk geen flauw benul van de vorm van 
onze 4-dimensionale tijdruimte M. Zij is op grote schaal recht voorzover wij 
kunnen zien, maar dat is helaas niet ver genoeg om uitsluitsel te kunnen geven. 




Fig. 39: Schets van de universele overdekking van de bundel van fc-frames 
over de cirkel, voor het geval A; = 1. 

We willen dus eigenlijk stelling 7.14 ondersteboven houden! In plaats van 
alle mogelijke velden te bepalen voor een gegeven tijdruimte M, bepalen we 
nu alle mogelijke tijdruimten, gcgcvcn do 'witte' lijst van velden waarvan we al 
weten dat ze voorkomen. Dit is do portcc van stelling 7.23 op bladzijdc 180. 

Een aantal mogelijkheden kun je nu al direct afstrepen. De gladde 4-dimen- 
sionale varieteit M met de naam CP^ valt bijvoorbeeld af, omdat zijn 'zwarte 
lijst' volgens stelling 7.14 overlapt met de 'witte lijst' van waargenomen velden. 
Als het bccld dat we hebben van de elementaire velden correct en compleet is, 
kan de tijdruimte M dus nooit van deze vorm zijn. 

Welbeschouwd is dit ccn wonderlijke conclusie: door goed to kijken naar 
de kleinst mogelijke structmcu, de elementaire deeltjes waaruit alle materie is 
opgebouwd, krijgen we informatie over de vorm van ons heelal. 
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